WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leu cine, 
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GLGAQIM*GQSFET 


11562 


25463 


A 


11662 


142 


437 


GQRVQIFFFFWKRFPFFAQGGTEGAQFR 
LIKPSPSGLKEIPPPNPPRKGE*RDGPP 
FRGD F VFLRKNG VP PCGQGGS KPPAPGE 
PPPFTPPRGENNRG 


11563 


25464 


A 


11663 


120 


353 


QALQVMLKHVNFFFFLFFLFWKQSFVPL 
PRLECNGATSVHCNLCLPG*RNFPASAQ 
EFRIAGSYDHAICPPASGKGE 


11564 


25465 


A 


11664 


1 


404 


NTWSDIj I ISYFHPRPS AFFLRQGLALS P 
RLE YRGT I TGHYS LDLWGS SDS PAS I S R 
VAGTT YRCM * FQLLGRLRRENHLS PRGR 
GCS DP* LCPYTP AW VTERD P VS KKKLRG 
GGENSLLSSQTMY 


11565 


25466 


A 


11665 


493 


3 


NIWIPDVLPGHLARQ*LGSPGKLSAGF* 
PLQFPQTLSSARAPPFCLPRLQVRNTVG 
FLPSPHTSRPSHCLPACPIiPSRTQPWVP 
VKPGPTACRGFLQHPPTPGSPSP*SFQG 
SSWWDTDLRALECLLRILRGRNCSVFVT 
TQGQLSHEGMNKEFLLTCAKKKK 


11566 


25467 


A 


11666 


3 


401 


CMERAVTVLLPGSATQSPPVYAPRALAR 
LWLTAAMMISGFIADYEA*SSRCSSACP 
AGD S L S Y YHS PAD S FS I MGS P VGAQD F C 
ADLGVS RANF I PTDTDMS TS PDLQWLVQ 
PALVYSVGPSETIAPHPLGVP 


11567 


25468 


A 


11667 


1 


446 


LiS CAKP PQRPLRHG 1 1 KI FNVFEE TRAN 
KHLIiGVS KLKPRGF PALFLS VS S FGQKK 
HVPQWLQHSHSHIIiVISLDFFFFFFFEK 
KFPFFPPGGRERDQTPFN*TPPRGG*RE 
SPPFPSRGGGTQGWAPHCPIIFRFFQKK 
GVFPLRAR 


11568 


25469 


A 


11668 


18 


403 


DPLQRPQYRGGLRSVGCDLQSPTPRGPS 
GRWPNPAQARGPGEAQAWAWPGGGPREA 
FSGQGRPPPLGLHPHRRKGAGPPGPMGP 
* HEGQGS C * GAS CRGGLACTAFPAASS P 
PIFGGLTPPNLNFPEE 


11569 


25470 


A 


11669 


157 


1 


PRPQGFVFVFFFFEMQFGRLECSGAISA 
H* NLHFLGS SNS PAS AS HV YAP L Y 


11570 


25471 


A 


11670 


396 


7 


EKVGQARDKS ACEGHS CADARS CLHLRV 
DQGGQHGSHGACPGRRLRGPDTHRQSCY 
Q CRCNRS PDRLQ PGRAGTP * CAE AAAE P 
DAAVHPGQPAPARLPPALPGRGLLPLPH 
PGGS QRVYCVRC I GS KLR 


11571 


25472 


A 


11671 


209 


3 


DDLGTHRPGKSQVRVTPREAGENPSAVL 
P C YLHL VP F V* E DE VKP EDS I P DMP GNE 
YAREFLAHAPLY 


11572 


25473 


A 


11672 


270 


101 


LS LNRWILGAYI I FFETESHS VAQAGVR 
*HNLGLLQSLPPGSRRPLAFKVAFLFIG 


11573 


25474 


A 


11673 


421 


592 


S S GTPGVI I KIP C VRSNHMKLPGP GMVA 
HTCNP S TLGGQGGR I TWS E * FKTRLGNM 
A 


11574 


25475 


A 


11674 


176 


6 


LTFVYMPCFCALSSLIMTYGIAM* FHFH 
SMTLFI IGVLS VALAI YQ*WLDVSRVSV 


11575 


25476 


A 


11675 


1 


418 


NT VCVCVCVCVCVC VERKAGHRLTDS FP 
I QAG PGAQARTL CLRE S TGGAL S S GL PA 
SRYQDPGRGALHPHCVWTPRGWNWRLNY 
P PARS PSREAVC YFRQTLKKWRS PRGQG 
GRILRVIRTASTFRDTS *TESACLGARP 
CLDGK*VCESVPCFSLHTHTHTHTHTHT 
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C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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HCI 


11576 


25477 


A 


11676 


305 


698 


SFMGRSPRKIDQFCNSRNMVHGSVTFRD 
VAIDFSQEEWECLQPDQRTLYRDVMLEN 
YSHLISLGSSISKPDVITLLEQEKEPWV 
WS KETSRWYPGK* E*S RQGEAI IVPDS 
PCAQRGVTPLRCGLENFLQ 


11577 


25478 


A 


11677 


107 


397 


GGDGRE TLRAP ADDG C I * QERAETPLAI 
SCPRSSST CRGRARR YDG CRTRDGHQL I 
QRGQQQESSRRTAQPGSSKLCRGPGGHR 
PSLGTVGVHGCSL 


11578 


25479 


A 


11678 


514 


554 


RAKPTPP*GGGEGQNFGLLKPPVSGLPP 
LACPTPPKRWKPKPPPPAPPHFLFF* *K 
RGLIF * PGEVFNLRNWKLPPS PPQRGGN 
KGRNQPPPRGTFMF 


11579 


25480 


A 


11679 


266 


3 


GPPLFLGLKN*TFFSPTKV* IFFSQVFS 
SFPQGLKGGVRE I FFFFFF* DRLECNGT 
I S AHCNLRLPGS SDSQAS AGTTGVCHYT 
RMY 


11580 


25481 


A 


11680 


1 


408 


NTCALRHRDHFS F WSDGKTKEGEER I VQ 
ENWL I S WNNFS VPETLLTIiS CPLGKEGM 
PGEDGTAGAGKVPGEDKI PG * DGTAGED 
GTEDENGTTGEDETAGVGKTPGAGGTTG 
EDDTES EDGTTGEDETAGKGGTAG 


11581 


25482 


A 


11681 


244 


420 


KDSESTKAYIRDNISSTKKKKSWPGAGA 
HP CNP S TLEGKGG * IT* SQEFETSLANM 
VKL 


11582 


25483 


A 


11682 


294 


443 


QMNTCTH I Q PAY VH * E KY FR P S AMAHAL 
NPSTLGGRGGWIT*SQEFQTRP 


11583 


25484 


A 


11683 


346 


1 


YNTNQFTLRGTQAS VYTCLTAL I VLKLL 
NQP YTLSAVLLNAGNQP FHAFP S PPSLH 
PDAFHSKIMSHIIILLLGFIPFLPASQS 
LTLS P SLERSGT I S AQCNLCL PGSSNS P 
ISA 


11584 


25485 


A 


11684 


140 


423 


SSHQASSPPTTASHSMKISVAAIPFFLL 
ITITLGTKTESSSRGPYHPSECCFTYTT 
YKIPAQRIMDYYETNSQCFKPGIVFITK 
RGHSRWTNPK 


11585 


25486 


A 


11685 


163 


3 


IKIPCNRLLKFGPPSCCGFFFPLLPPFF 
FFETLSRSVAQAGLQWRDLGSLLAL 


11586 


25487 


A 


11686 


179 


1 


PIVFLGMLFVAFAFRGPFESVTKPPNFF 
FF FE ME S CS VAQAGVQWRD LG S LQAR P P 
RST 


11587 


25488 


A 


11687 


192 


1 


EQGFDGHLASWISLKDTAKWKNGDELCS 
YYQSCCTDYTAECKPQVTRGDVFTMPED 
EYTVYDDG 


11588 


25489 


A 


11688 


89 


3 


SGGYCCCLCCRCSRYSCCCCRLREGPTK 


31589 


25490 


A 


11689 


168 


2 


QDRLQ PHE S KRKLGPAPQLRRS E S DTP S 
WNFQSTESQLMSKGDEDTKDDSKETV 


11590 


25491 


A 


11690 


301 


397 


HFVFS LVTGMNPLS P YLNVDPRYLVQVR 
LRFY 


11591 


25492 


A 


11691 


18 


421 


TKALQ I TCYLHS TMSEESDGKL I EDSL I 
QLRCHFTWKLL I E APE I PDLENR I WEE I 
Q FLDT KYNVG I HNLLAYVKHL KGQNEE A 
LVSLKKAEDLIQKEHANQADIRSLETWG 
NFAWVYYHMGRLAEAQTYLDKV 


11592 


25493 


A 


11692 


188 


3 


PLQGKGFGAKERGQKKGFLRGGVRASRP 
RPPFGPSLFFFFFFETDSHSVAQDGVQW 
CDLGS 
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N=Asparagine, P=ProIine, 
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T=Threonine, V=Valine, 
W=Tryptophan 7 Y=Tyrosine } 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11593 


25494 


A 


11693 


24 


391 


APRADAMGHFTEEDMATITSLWGQGNVE 
DAGGETLGRLLDVYPWTQRFFDS FGNLS 
SASAIMGNPKVKAHGKKGLTSLGDAIQH 
LDDLKGTFSQLSELHCDKLHVDPENFKP 
LGNGLQTALQ 


11594 


25495 


A 


11694 


138 


425 


NS GVKAAFQ LNP GNPNL EGKPGP GFKTH 
PGQLGETHFLLKIHYLSGLVGKPWESQIi 
LGNLNRENHWTPERGGPRDLRHCHGIQP 
WATNGNFVLKKK t 


11595 


25496 


A 


11695 


158 


838 


CGTVHSCDAGQRQRATAPSHPCDHGNQQ 
PILYRVLCCQLFWILCFVFSHSIMSTKC 
PSVSPVSGEMKKRKAITLEMKLKI IAQH 
EGGKP VMAI ARELGLWQS T I ST I LRDKK 
Q I S DAAKS S ASVKS T VI TKKRAGP I DDM 
EKLLVMWMEDQ I QKRI PLS LLM I QAKAR 
SLFWMLKDRASDPTYTQMFKASHGWFQR 
FKRRHNFHNVKITGEAARAGNEGAIAFK 
EQL 


11596 


25497 


A 


11696 


834 


1431 


SSACQGGSQGWPPRQPWWMWGWSRRAPS 
VTHRTGERCSTSGSAPLPLLLSSLCSSL 
PRSWEHSTLRQPPHPPLLPPGCLPPGRE 
AQRIiS GAGHGPGPR PAAAS YI PD TQADR 
KQQQHHPGPDQHVGHREGLALEDAATHL 
GLVAALSCWLLGRAEAGYQVPHGHDHPK 
DQHPQADGGQRIVRAIGLGLGHHVSGRR 
ARP 


11597 


25498 


A 


11697 


143 


1 


I S KERGASRFSGPWVFFFFFLESRS VAQ 
AGVQWCNLGSLQAPPPGFT 


11598 


25499 


A 


11698 


58 


459 


KGKE E KVKRKEAEQNFS PYAQDKQER I K 
GNENDE KT KQGKE T 1 1 D I E L FKGLD ETG 
ENMDS TLTRT PFE PLENNKQ I LVLGLDG 
AGKTS VLHS hASNRVQHS VAPTLGFHAV 
CINTEYSHMEFLE IGGSKPFRS 


11599 


25500 


A 


11699 


215 


1 


GTKKALWAGGGGFFPYLPPMGPLGPICG 
GVGKGS PGLGCP I FFFFFFFETES CS VA 
RLDAQWPDLGSLQSP 


11600 


25501 


A 


11700 


300 


420 


KPKI LFGNVFAAPHMENLKCRGETVAKE 
I S E AMKVKAM C 


11601 


25502 


A 


11701 


351 


466 


RIKNADLSQAQWFTPVIPALWEAKVGRS 
LEVRSSRPVW 


11602 


25503 


A 


11702 


172 


400 


SNRLRNKIQGVFLNDSS ISPFI LRKQS I 
GQAWWLTPVI PALWKAEASGS PEMDS TT 
LLPS S S QVPSLVKMEKLNYS 


11603 


25504 


A 


11703 


284 


408 


ASVFS S FFF VCLF VFEME SRS FAQAGVL 
WRDLGSLQAPPPG 


11604- 


25505 


A 


11704 


20 


447 


LPGADYGGGHLSLRLFHIiLLASAAWVPD 
ESQVTLNSAICVLSTVLIMEFPDLGKHC 
SEKTCKQLDFLPVKCDACKQDFCKDHFP 
YAAHKCPFAFQKDVHVPVCPLCNTPIPV 
KKGQ I PDWVGDH IDRDCDSHPGKKKE K 
I FA 


11605 


25506 


A 


11705 


1 


455 


HSCSLQFTPTAWDCTGSVSSEQGERPAA 
AMKI CSLTLLSFLLLAAQVLLVEGKKKV 
KNGLHS KVRSEQKDTLGNTQ I KQKSRPG 
NKGKFVTKDQANCRWAATEQEEGISLKD 
ECTQLDHEFSCVFAGNPTS CPKAQDERV 
YWKQVARKLRS 


11606 


25507 


A | 11706 


1 


428 


DAEADKMAAAAVRGGRSGGSGGCSGAGG 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ASNCGTGSGRSGLLDKWKIDDKPVKIDK 
WDGSAVKNSLDDSAKKVLLEKYKYVENF 
GL IDGRLT I CTI S CFFAI VAL I WD YMHP 
FPESKPVLALCVISYFVMiyiGILTIYTSY 
RE 


11607 


25508 


A 


11707 


1 


422 


RSQARSSAAAAARASVPLRGSPGPSAIM 
PMFIVNTNVPRASVPDGFLSELTQQLAQ 
ATGKPPQYIAEHWPDQLMAFGGSSEPS 
ALCSLHS IGKIGGAQNRSYS KLLSGLLA 
ERDRISPDRVYINYYDMNAANVGWNNST 


11608 


25509 


A 


11709 


88 


423 


AADAMKQAEEAMGAFMRLAYQELQIDRL 
KEDKMLLNLEGNKREHAERLGMGLVSRS 
SVSHSVLSEMLVIEHETPVSAKSSRSQL 
DLFDDVGTFCSGPSKYTDNPFSLWESIG 


11609 


25510 


A 


11710 


393 


130 


NP S ATAQ S FHIiAARKL S TLDR P GS Q AP Q 
KP MP S P KP GWL PAS TNAV P T P LAE AT P 
SKAHPAISLLSTEEGIFKAVPSPASSCS 
FLHV 


11610 


25511 


A 


11711 


331 


440 


K I FFLNFL I SRAWWCAPLVLATWEAEVG 
ESLYPRRS 


11611 


25512 


A 


11712 


122 


478 


SGLCPQQPPRANSCPPSSMASCAEPSEP 
SAPLPAGVPPLEDFEVU3GVEDAQGEEE 
EDEEEEEEDDLSELPPLEDMGQPPAEEA 
EQPGALARE FLAAME PE PGPS PVPKEWL 
DILGNGL 


11612 


25513 


A 


11713 


102 


2 


TNLGNP RRP P P F F F F E T VS LLAQ AGVQW 
CDLGS 


11613 


25514 


A 


11714 


126 


2 


FFAPFLKIFFFFFFSEMESCSVLQAGVQ 
WHDLGSLQNEGSQ 


11614 


25515 


A 


11715 


178 


449 


LSGGNLFGI I ILLYCAI IGGKLLGLIKL 
PTLPPLPSLLGMLLAGILITNIPVINDN 
VQ I MHRW SYS LRS I ALA I 1 L VRAGL VLE 
SKALEK 


11615 


25516 


A 


11716 


48 


417 


GSGGNHSVCCDTMEGGGGSGDKTTGVLA 
G F FGAGE AG YSHAD LAGVPLTGMN P LS P 
YLNVDPR YLVQDTDE F I LPTGANKTWGR 
FELAFFTIGGCCMTGAAFGAMNGLRLGL 
KETQNMAWSKP 


11616 


25517 


A 


11717 


103 


2 


PKSPTQWLMAVIPALWEAKAGGSRESRS 
SRPAL 


11617 


25518 


A 


11718 


1 


413 


WPSGQVLVGCLSFSLYCWKRSLKRKNPG 
FEVLLKI FLKNHPRCHTNRMIQLTATPV 
S ALADE PAH I RATGL I PFQMVS FQASLE 
DENGDMFYSQAHYRANEFAEADLNHAAS 
LGGDYLGDLALRLLCTLEPVFPQSY 


11618 


25519 


A 


11720 


116 


462 


AGMLPAVGSVDEEEDPAEEDCPELVPIE 
TTQSEEEEKSGLGAKIPVTIITGYLGAG 
KTTLLNY I LTEQHS KRVAVI LNE S GEGS 
ALE KS LAVS QGGE L YE E WLE LRNG CLC C 
SVK 


11619 


25520 


A 


11721 


167 


407 


EIYSLTRFIEVKMSKKISGGTWEMLGD 
EMTRIIRETLKEKLIFPYVESHLHSYDL 
G I ENRDATND QAT KDALE AFNKP Y 


11620 


25521 


A 


11722 


254 


423 


NQLSS IMAMFKKI KS FEWFND P E KVYG 
SGEKVAGRVI VEVCE VTRVKAVR ILACG 


11621 


25522 


A 


11723 


3 


424 


VS CDTMEGGGGSGDKTTGGLAGFFGAGG 
AGYSHADI^GVPLTGMNPLCPYLNVDPR 
YLEQDTDEF I LPTGANKTRGKFELALFT 
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N=Asparagine, P=Pro!ine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IGRCCMTGAAFGAMNGLRVGLKETQNMA 
WSKPRNVPILDMVTRQGALWANTLGALA 


11622 


25523 


A 


11724 


2 


343 


AFGTMKWVTLVS VLFLFS S AYSRGVFRR 
DAHKSEVAHRFKDLGEENFKALVMIAFA 
QDLQQCPFEDDAFSTSEVPEFATTCQDD 
DHFEDRCRRVHTLVWQLCTPATLLETD 
Y 


11623 


25524 


A 


11725 


1 


359 


HAFGTMKWVTVI SLVFLFNSAYSRGVFR 
RDAHKSEVAHRFKDLGEENFKALVLIAF 
AHYLHQ G P FEDHVKLVND VTE FAKT CDA 
DESAENCDKSLHTLFGDKLCTVATLQET 
YGEMADC 


11624 


25525 


A 


11726 


1 


349 


GAMSDRKFSAPRHGSLGFLPRKRSIRHR 
GKAKS FPKDDPS KPGHLTGFLGYKAGMT 
HIVREVYRPGSKANTNEVAEAVTIVETP 
PMEVAD I AG YMETPRGLRTFNTAFAEHM 
SDEC 


11625 


25526 


A 


11727 


81 


349 


TKGSWSCCVCLCVCLCTWSHLCRLVTW 
LPDMPDDVLWLQWVTSQVFTRVLMCLLP 
ASRSQMPVSSQQASPCTPEQDWPCWTPC 
SPEGC 


11626 


25527 


A 


11728 


264 


388 


QADPKDIMKFFGPLENQRLS FLLEKAI T 
REAQMWKVNVRKM 


11627 


25528 


A 


11729 


2 


471 


PGCSASWSKRGSGPDMLSSMAAAGSVKA 
ALQVAE VLE AI VS CC VGPEGRQVLCTKP 
TGEVLLSRNGGRLLEALHLEHPIARMIV 
D CVS SHLKKTGDGAKTFI I FLCHL.LRGL 
HAITDREKDPLMCENIQTHGRHWKNCSR 
WKFISQALLTFQTQIL 


11628 


25529 


A 


11730 


160 


377 


LQGFGRPSVYHAAIVIFLEFFAWGLLTT 
PMLTVS IAELGLCFVRERDKFLGTYHCV 
CLDTCLGVAI/DSDIiKQ 


11629 


25530 


A 


11731 


45 


438 


KIiGQRKE AT VTKE S CKS CSRKVGS PDRF 
RSPQKRSKGRQDCFTLFCFWKVLDKNME 
LISPTVI I ILGCLALFLLLQRKNLRRPP 
CIKGWIPWIGVGFEFGKAPLEFIEKARI 
KVCGRGRRGLQRRQCFLF 


11630 


25531 


A 


11732 


169 


292 


DS ILLVNLACS AVI S AHCNLRLPGSSDS 
PAS AS RAAGGAHL 


11631 


25532 


A 


11733 


473 


600 


KFWE KRTEGRGKS KS KNSETGE I VS IS A 
LS TTEVAMHTSTSS 


11632 


25533 


A 


11734 


19 


349 


AP S PDAMGHFTE E D KAT ITS LWGT VNVE 
DAGGETLGRLLGDYPWTQRFFDS FGNLT 
SASAIMGNPKVKAHGTKVLTSLGDAIKH 
LDDLKGTFAQLSELHCDKLLVDPENF 


11633 


25534 


A 


11735 


234 


359 


F YHLKS FTVS S VQS RWLTPVI PALWEAE 
AGGSPEVKSSRPAS 


. 11634 


25535 


A 


11736 


15 


372 


KLPLKALTGEEKTHINIDI IGHVHSVKS 
TTTGHLIYKSRGIDERTIEIFEKEAAEM 
GKGS FKYAC I LDKLKAERERGI TIDISM 
RKFETSKYYVTIIDAPGHRDFIKDMTTG 
TSHADCA 


11635 


25536 


A 


11737 


1 


357 


IWKAAMASPAGSWARPPRPMREPQTLAM 
PTNAAEDQKLKLERLMKS PDLAVT I PEK 
MSEWSPGPPPEFDRDVMGSRAGAASGEF 
HVYRLLRRRE YQRQDYMDAMAEKR I LDA 
EFQRRLE 


11636 


25537 


A 


11738 


24 


335 


APNANAMGHYTEEDKAT I TS P WGKGNVE 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=0Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














DAGGKTLGRLLDVYPWTHRFFDRFGNLS 
S D S AI MGNP TD KAHGQ KVLT S LGDATKH 
LDDLKGTFAQLSELHCDKLH 


11637 


25538 


A 


11739 


141 


335 


MQFLPCIPILKSLEKSVASHSQTVHSDI 
I S TVE FNHTGELLS TGDKGGR WI FQRE 
QESKNQVHR 


11638 


25539 


A 


11740 


7 


337 


APS PDAMGHFAEEDKAT I TS L WCKVNVE 
DAGGE TLGRL LAV Y PWTQR F FDS FGNL T 
SDSAIMGNRKVNAHGTKVLTSLGDAIKH 
LDDLKGTFAQLTELHCDKLHVDPENF 


11639 


25540 


A 


11741 


182 


360 


SRHS I S PPVNQ I QLGAS VTEE LTWTKT 
ARVSRAQWLMPVI PALWEAEAGESPEVR 
NSR 


11640 


25541 


A 


11742 


174 


1 


HFSDHS LFVCVRQDL.TLLPRLECSS L I T 
AHC S IiHLLGS GD P P TAAS W VS GNTG VH Y 
HA 


11641 


25542 


A 


11743 


75 


218 


KI I LGRAQWLMTV I PALWE AKVGRS P VS 
ASQSAWDYRREPPCPSCTI 


11642 


25543 


A 


11744 


168 


2 


LAPLWSLGPVLGGVGQGSPGPGFPFFFF 
FFFFETESCSVARLECSGPILAHCSLR 


11643 


25544 


A 


11745 


83 


2 


REQRFLLP FVQLC PAPRGGVYRGRQAS 


11644 


25545 


A 


11746 


156 


3 


FHSGSGRVENPS F F FS FFETGS CS VTQD 
GECTGATLAHCDLCFLGSSNSS 


11645 


25546 


A 


11747 


244 


330 


KDRAQWVTPVI PALWAAKAGRSLEVRSS 
R 


11646 


25547 


A 


11748 


201 


1 


TSQPLIRITLSTFFFFSRDGGLTMLPRL 
DPELPGSSNPPSCSASRVDGTTGMCHHT 
RLIFFSQTCKK 


11647 • 


25548 


A 


11749 


262 


3 


VYTSLTPPP I SVNIiLTQVS PRSVSYEQN 
ILCLLFVTYFTCTFFVCIiFVCLFVCFFK 
MES CS VAQAGVRWRDLGLLQAPPPGFTY 
TT 


11648 


25549 


A 


11750 


225 


3 


PIXNFSVPQFLHLYNGIIIESTPPSCCI 
LKCQSLGQAQWFTPVITTLWEAEAGRSL 
E AWTLKT I LANMAKPHL 


11649 


25550 


A 


11751 


185 


2 


VSTPFNSFPSPPWDLVFLKGFFFFFFFF 
FFETEFCSSPRLERNGAPIiAHCNLRLPR 
SSNSP 


11650 


25551 


A 


11752 


261 


336 


DRVSLWSPRLECNGAISSHCNLRLP 


11651 


25552 


A 


11753 


282 


381 


TDLLYKKKCGLGAVAHAYNPSALGGQGG 
RITCS 


11652 


25553 


A 


11754 


202 


45 


NPRKVKLQWALILPLPFNLGGQIiKSRFQ 
KKKKKKKKKKKE E KR KKKTLVVI C 


11653 


25554 


A 


11755 


293 


357 


LTPVI PALWEAEVGGSPEVRS 


11654 


25555 


A 


11756 


264 


1 


TLPINWCCDHSTGPSLISLPLIiSLPYS 
LRHRNIEIRLINNPTTACKYSCLKKSHK 
SLTLDQKLEMIKLGSEGLLKAQIGQKLG 
LLHQ 


11655 


25556 


A 


11757 


126 


1 


ILPGFLKELKIELPFNPAILPLGIYPKE 
KKLLYQKDTCTHMF 


11656 


25557 


A 


11758 


399 


226 


I EKGTGRVWWLTPM I PALSEAKAGGS PE 
VRSPRPACIGLLKFWYYRCEPQHPARLS 
EI 


11657 


25558 


A 


11759 


235 


380 


DTTVLKLGQLITLQWASKCSSERKSRTS 
LTLTQKLEMVKLSEEGMSKGD 


11658 


25559 


A 


11760 


249 


357 


NRASGQTWCPTP IMPALWEVEAGGSLEP 
RSLRLTWA 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M= Methionine, 
N=Asparagine, P~ Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine, V=VaIine, 
W-Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11659 


25560 


A 


11761 


168 


372 


KLHCLLCFYLSEFFVFCFLETRSLSVAQ 
SKCSGTVITHCSLKLLCSSDPATSDSRV 
AETTGTGYHAGL 


11660 


25561 


A 


11762 


303 


377 


QFAGCRWLMPVT PALWEAKADGSPE - 


11661 


25562 


A 


11763 


135 


1 


ATTPGLFFIFYFFETESHS IAQAGVQCM 
I S VHCNLCLPGS SDS PV 


11662 


25563 


A 


11764 


219 


2 


KLRSSQLSEPEVRPDSTLSMLMHARAHT 
HTHTHPHLFLPHPCSRPHTSTPTCMPIV 

THSPIYLKFHSHTYIL 


11663 


25564 


A 


11765 


102 


1 


NRPINNPTMASMCSSERKSHKSLTLNQK 
LEMIKL 


11664 


25565 


A 


11766 


245 


3 


PGEPEEDQLVKFRPSLLWVFMFLFFLRR 
SLALS PRLE CS GVE CSGVI SAHCNL CLT 
GFKQFSCIiSLLSGWDYRHVPPCPS 


11665 


25566 


A 


11767 


350 


97 


GSVPASPQVTRPTLMTSLADKWFSLCLT 
SLRVNFGVALILSGSHSVSQDTLDLLTL 
FSTHLGLPKCWDYKRVPPCSAQFLVTFL 


11666 


25567 


A 


11768 


188 


391 


LGSVAGDLLCFFGGVIFPCSFMFPMSLC 
YYLY I WYNS CVFLFFKTE PHF VTHSG VH 
WCDGLLQPPPPN 


11667 


25568 


A 


11769 


131 


2 


YTLYIPNKSPIENIQLSWAQWLVPVIPA 
LCGAE TGELLE PRS S 


11668 


25569 


A 


11770 


165 


1 


VLIPLIHLCIYIYIYTYICVYIYTRIYT 
HICIHTHVYIHIYVYIHTYIYTYMYTY 


11669 


25570 


A 


11771 


72 


1 


ILLFFFFFEAESCSVAQAGMQWCN 


11670 


25571 


A 


11772 


121 


3 


TSFVLFCFETEFCSCCCPGLECNGAILA 
HCNLRLPGSSD 


11671 


25572 


A 


11773 


152 


3 


HVCLNLTLPFLFEKNIFSLCVCVCVCVC 
VCWL C KKKKCE KD F YFE RT F 


11672 


25573 


A 


11774 


190 


2 


GFSPRQRGAPRVPPPIoAGPFPPIFFFFF 
F FEMW S H S VAP AG VQ WCNHS S LAALT S P 
DSGDPP 


11673 


25574 


A 


11775 


287 


1 


GAHRRKLFITPGESLRDKQIAGFEHRGG 
EKKKTLFYKKKKKRKKKRKEKKGKEKKM 
LPKCNYTMKNHQVPYSIRPTRECADLRV 
FYFIKFQILKY 


11674 


25575 


A 


11776 


121 


3 


KCASRDLSKFFFFFFLETESRSVAQAGV 
QWCDLGSLQAL 


11675 


25576 


A 


11777 


142 


1 


E KTLHVRNT IHNS EGS LVI KIHGRLGW 
AHACNP S T LGGRGGQ I TRS 


11676 


25577 


A 


11778 


179 


3 


SHQPVPGTLDLPRGPQKLQSTSEAESEA 
S MS E AS S E DLVP P LEAGAAP YRE E EEAA 

KK 


11677 


25578 


A 


11779 


147 


6 


KTPGLKKNSNFFFFFFFFETRSHSIARA 
GVRGCDLSSLQPPPLGLK 


11678 


25579 


A 


11780 


184 


3 


GFGFYI I PNYRLFFFSLLIGPFFFLIFF 
ETEPCSVARLECSGVISAHCNIRLLGSC 
DSPA 


11679 


25580 


A 


11781 


120 


319 


VKSLAKFLVNYQLHRQTCSGQAGRGALR 
QRFGRPRQVDHLRPGVRDQPGQHGETPS 
LLEVRKLSSG 


11680 


25581 


A 


11782 


91 


3 


Y I CL S L I YTHTHTHTHTHTHTH I Y I Y VC 
V 


11681 


25582 


A 


11783 


221 


332 


S RLGMVDHPRS GVRDHPGQHGE TPS LL K 
IQKLARRSG 


11682 


25583 


A 


11784 


243 


379 


LKYS VP I KNCKPWPGTWDHTRNP STLGG 
QGGWITLDQKFETSLiAN 
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NO: of 
nucleotide 
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SEQ ID 
NO: of 
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M 
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od 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Giycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


11683 


25584 


A 


11785 


327 


5 


GRVDSQPIYPKNPPGSFPKIYPGSFKTL 
FPYLQSRHS FS I PRS KLARFHQAKPLF C 
S KPS QS LiS PHS ELKKKDLS QARWLTP VI 
RALWESKAGRSPEVRSSRPTWPT 


11684 


25585 


A 


11786 


245 


359 


FKGMDRLRSGVRDQPGQHSKTPSVLKIQ 
KLAGHGGMRL 


11685 


25586' 


A 


11787 


356 


67 


RDIASITARLRYDRGVGITRLEVLSKSD 
EYTKGSVLDFSRETEPIGYIYIYIYIYI 
YIYIYLHSLPLNHMEISLGHPPLSNNKF 
VQNFDSPKLSYH 


11686 


25587 


A 


11788 


103 


3 


KYIIPVCIFFFFETESCSVTQSGVQWCE 
LRRRG 


11687 


25588 


A 


11789 


136 


1 


VHQFTIRILNYYYYYFFETESYSVTQAG 
VRWCNLGSLQPLPPGFK 


11688 


25589 


A 


11790 


291 


163 


S LLLL PRLE CNS T I S AHRNPRLLGS SD S 
PASGRSLEPRSSKLQ 


11689 


25590 


A 


11791 


107 


? 


I FMYVFFETESRSVAQAGVQWRYLGSVQ 
APPSGFS 


11690 


25591 


A 


11792 


329 


3 


KNPRN 1 1 GKSRGPY I TNFPGDKGP KNS S 
PRGNDLS INKNWAQAFQPRGQKKLFFPK 
KKKKKE TQ S RS I P RL E CNGPT I AQ CNL K 
LLASSNPPTSASHSAAIISMSHSG 


U69\ 


25592 


A 


11793 


213 


3 


I S PWLFLLLQATLFRS QYPGPFPPFSLS 
PYLFPFKQKLGSGFFFFFLFFFKTESRF 
LARLECSGI ITAQ 


11692 


25593 


A 


11794 


1 


400 


KRAAPQPAPEQRDLKKKKKKPPLAPSSL 
FRFWEKLLPSFQFRQPRGPFLGTRGVSF 
AFHRRRF WQ YGNMGE KQ TWGNP GS S SN P 
PPGTAGSLSGNRGPGWGGKIADRFWLYA 
RDS FGHS RGLKGS CCQAHLGK 


11693 


25594 


A 


11795 


3 


769 


RKEQTRNARAEVLRQAKANFE KEERRKE 
LKRLRGEDTWMLPDVNERIEQFSQEHSV 
KKKKKKD KHS KKAKKE KKKKS KKQ KYE K 
NMESSDSSSSSEDEWVSAVPSQTPDKEK 
AWKVKDEKSGKDDTQIIKRDEWMTVDFM 
S V KT VS SSSLt KAE KE TMRK I E Q E KNQ AL 
EQSME I FQS KLEDAEKAASTKEDYRRER 
WRKPTYSDKAQNCQESRESDLVKYGFCS 
RDRYATTDTAKNSNNEKFIGDE KDKRPG 
SLE 


11694 


25595 


A 


11796 


110 


13 


HTHTHTHTHKHTHTHTHTHKDRSGKIKC 
VPTV 


11695 


25596 


A 


11797 


184 


2 


SE I FGKFHVFYILTVHFMPPTQVYVERD 
VLGQEQLLTPVIPAIiWEAKAGRSPEVRS 
SRPA 


11696 


25597 


A 


11798 


83 


387 


GERRRRRRRRLWAPLQKKKKKKKKKKKK 

RQKNIPPLKGGELKRAPAGDFKNPGRGK 
IARGGFFEKNLSWGGEK 


11697 


25598 


A 


11799. 


153 


2 


IPWEDTTVYSGYPQFLFSHSFFYFYFFE 
TGS CSLTQTGVQWCNHGSLQPR 


11698 ! 


25599 


A 


11800 


93 


3 


VWGRAWWLTS VISALWEAKVGES PEVRS 
S 


11699 


25600 


A 


11801 


115 


1 


NPLFFFFFFFEMESCSVAQAGVQWCNLG 
SLQPLPPRFK 


11700 


25601 


A 


11802 


295 


137 


RCFIFILFINKLYFVYLFIYFFEMESHT 
VAQAGVQWRDLGSLQAP PPNKLYF 


11701 


25602 


A 


11803 


287 


1 


MGPHAPFKGTCGGFQDFRFKNLEISAGL 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














THS P L LiAQNFRVGAVP S PR YS P R FG S KG 
GGGFFFFFFFFETESHSVAQQAGVQWCD 
LGSLQALPPGF 


11702 


25603 


A 


11804 


202 


3 


WS VCCFKQGLGLS PRLECNGAPGFKQS F 
P LS LL S NWGYRG P P PRLNF F F F F FEME S 
CSLAQAGVQR 


11703 


25604 


A 


11805 


3 


315 


FMLL I LTLFLLRNDRLVQCDVRS SWCL 
GFLLGWSVILYPLMAAFMPTMWIFLFIL 
FSMLFF Y VFFLFLLFFLLFLYS FPLFCF 
FFCFYLFRFFFFFLI FFSS P 


11704 


25605 


A 


11806 


126 


3 


KRGFFFFFFFFFFETESRSFAQAGVQWC 
DLGSLQALPPGV 


11705 


25606 


A 


11807 


130 


2 


QS I KMTCS SLYFF I KKNVAWWLTP WPA 
LWEAGAGGSHEPKS 


11706 


25607 


A 


11808 


149 


1 


GGKKAILFFFFSLSPRLECSGAIPTHCK 
LCLPGSRHSPASAFRWGTAG 


11707 


25608 


A 


11809 


258 


3 


KYYYQGNRQIKFVNAKSNKNLPSYSQKA 
SWFSSRNI RMMGQHQ PMHLDNPTS FIKK 
LGQAW WLRP VT PALWEAEAGGS LESRSL 


11708 


25609 


A 


11810 


153 


285 


CASP IRSHQKLNKNGWVQWLMPVI PVPW 
EAETGGSLQLRSSRLA 


11709 


25610 


A 


11811 


160 


3 


NHLFLLLNRYKS I F IGRVQWLMP VSSAF 
WEAKAGRS FEPRS S RPAGQHGKTL 


11710 


25611 


A 


11812 


89 


2 


AFFFFFETVSCSVAQGGVQLCHLHSLQL 
N 


11711 


25612 


A 


11813 


254 


336 


LLGQPQWLTPVIPALWEAEAGRPSEVR 


11712 


25613 


A 


11814 


108 


2 


C VGVLG F F VCL FFETESCS VAQ AGMQWH 
DLLGSLH 


11713 


25614 


A 


11815 


166 


289 


SHSGWNAVWRDGSLQPLTPGFKRFPCLR 
VPSSWDHKCAPPC 


11714 


25615 


A 


11816 


192 


2 


DKDPGKVTRP FLLLC I FYLF I CLFLRRT 
LAVSPRLEFSGRGCSEPRSHRCTPSWVT 
EQDSCRK 


11715 


25616 


A 


11817 


228 


343 


LLQYS IQNEDAGWAWWLTPVI PTLWEAK 
AGRSLEVRSL 


11716 


25617 


A 


11818 


322 


407 


VLRMLLHCLRECKLVQPLWKTVWQFLKD 


11717 


25618 


A 


11819 


119 


1 


WEKIVRGKKCQKENTHTHTHTHTHTHTH 
THTYRESKRERLV 


11718 


25619 


A 


11820 


269 


1 


FVQVFY I LWTQS FCLTCCQWFLLGYHLS 
SVSNRFMRSGKSTFFFMTSEFFFFEMES 
RCVTQARVIiECSGS I SAHCKLHLPGPRH 
SPALV 


11719 


25620 


A 


11821 


217 4 


389 


EH I ET VYGKLG I S EHWGKGGL FNKWLT I 
WKKD YS WQWWLMPV I PALWEAE VGGSRV 
H 


11720 


25621 


A 


11822 


188 


1 


GSHMPCRVISSVHESMNEFPAVPTSYPA 
NP Q PRE RAWRNQRE KE D KKE RS QRS VGR 
VQAGLV 


11721 


25622 


A 


11824 


126 


3 


KLQGQVQWLTPVI PAPWEAEAGRS PEVR 
SSRPASTWRNLV 


11722 


25623 


A 


11825 


224 


3 


AL I IKKEVSAMELEGSNRIQAYGVQS IT 
GARDYHAAAS RVPAI KGTHHHARV I FVF 
LVETGFHHVGQAGLDS C 


11723 


25624 


A 


11826 


3 


364 


HELPEPLRVLWTAHRVAMAPGSRTSLLL 
AFALLCLPWLQEAGAVQTDPLSRLFDHA 
MLQAHRAHQIiAIDTYHEFEETYI PKDQK 
YSFLHDSQTYFCFSDSIPTPCNMEETHQ 
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Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=GIutamine, R=Arginine, S=Serine, 
^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, -—Stop codon, /^possible 
nucleotide deletion, V— possible 
nucleotide insertion 














KSNLELLR 


11724 


25625 


. A 


11827 


2 


376 


ARE L P E P I T VLRTAHLKAMAP GS RT YLL 
LAFALLCLPWLQEAGAAQTDTIiYMLFDH 
AMLQAHRAHQLAIDTYHEYDETYIPKDH 
KYS I LHDYQTS FCFSNS I TTP YNKEETQ 
QKSNLELLRISLL 


11725 


25626 


A 


11828 


288 


3 


IHTKNPSVHHHHQRPKVDKTTKMGKKQS 
RKTGNS KKWSAS P P PKEHS S S PATEQSW 
TENDFDELREEGFRRSNYSELQEEIQTK 
GKEVENFEKN 


11726 


25627 


A 


11829 


107 


1 


KRSGVRLQRGLVRWLTPVI PTLWEAEVG 
GSFEVRC 


11727 


25628 


A 


11830 


220 


354 


QGNKS FKNI I AP WLGTVAHACNPSTLGG 
QGGR I TGHE I ET I LANM 


11728 


25629 


A 


11831 


273 


352 


ENLKTGQIQWLTPVIPALWEAKAGRS 


11729 


25630 


A 


11832 


239 


488 


SQTHCSGKTENSHGTNSDRNQVPTALHK 
RPSSPARFLTEAVPVPPVDYLQSVAAFA 
VSAVASQWERTGKPFNPLLGETYELIR 


11730 


25631 


A 


11833 


189 


294 


DQTRWLT P VI PALWGAKASGS P E VRS S L 
HFPLLF 


11731 


25632 


A 


11834 


1 


332 


GTSPEPLTVLWTGDLLAMAPGSRTSLIiIi 
AFALLCLPWLQEAGAVQTDPLSRLFDHA 
MLQDHRAHHLAIDTYHEFEETYIPKDQK 
YSFLHDSQTFFCFSDSIPTPSNMEET 


11732 


25633 


A 


11835 


330 


175 


PRELAQLGSPNVRVSRVKPRGQPQMFFF 
FFWRQE S CS VAQAGVQWHE PEQQE 


11733 


25634 


A 


11836 


96 


1 


VFVETGSRSVAQDGGQWCDLSSLQPQPP 
RPRA 


11734 


25635 


A 


11837 


267 


334 


SWVQWLTPVIBALWEAEAGGSP 


11735 


25636 


A 


11838 


180 


2 


SVLEKKEKNLYKNLFTYKLLKKVQGKTQ 
CEGRAQ I CSVCVCVCVSVCTCVHVCAYV 
SSC 


11736 


25637 


A 


11839 


183 


2 


AAPLTSSAPQAPGWKPHLAPNSPFKNFT 
GRGQAQWLMPVI PALWEAKVGRSPEVRS 
LiSSC 


11737 


25638 


A 


11840 


54 


330 


DPNGQLPEPLKGLWTAHLVAMAPGSRTS 
IxLLAFGLIjCIj P WLQ E AGAVQ T VP lsrl f 
DHAMLQAHRAHQLAIDTYHEFEEPYIPQ 
DQKNSFLD 


11738 


25639 


A 


11841 


117 


344 


IQYVKQINETEMRNNESYLNHTSLTITI 
HTLCLMGSYLEHFQNQCKGRARRLMPVI 
P ALWE AKAGRS PD VRS S KP A 


11739 


25640 


A 


11842 


750 


968 


RAQGPRWILKIPFSSPFRLFPLIPLVFL 
YNSPPLFPPFPRTNATNNSPFPPPLPPP 
LPFSPPPPPSPSRPPPS 


11740 


25641 


A 


11843 


757 


1000 


WELVWYCCLFSPFFPLPSPPLFPPPPP 
SPPHSFPPPSPFRPPLPLAPTRPPPPFS 
LFQAPPLLPSFCSPPPPPPPSLSFL 


11741 


25642 


A 


11844 


278 


361 


TLLFFFFFFEMESHFVARAEVQWRDLGS 


11742 


25643 


A 


11845 


260 


2 


GS CLIiEGKLTNRKD IHTKNPS VHHHHQR 
PKVDKTTKMGKKQS RKTGNS KKQS TS P P 
PKERSSSPATEQSWMENDFDELREEGFT 
RA 


11743 


25644 


A 


11846 


194 


2 


TKFAKEPSPPPCWQEVFKTSPLGFFFCP 
RS F P VNG F PRLKNL F F FFE TE S C S VAQA 
GVQWCARA 


11744 


25645 


A 


11847 


219 


1 


SLGHTLKQSESSLSFSPGGTPLLEFKPS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














VYDPSVPPFQVSKGGCVGFFFFFQTESC ~ 
SVAQAGVQWHDLGSPRA 


11745 


25646 


A 


11848 


126 


3 


RFFFKGLFLHFFFFFFFFCFETESHSVA 
QAGVQWCDLGLV 


11746 


25647 


A 


11849 


216 


339 


KCFKVGKVYKLNRETFYLAQNFFDRNWA 
TQKMWKTLLELI 


11747 


25648 


A 


11850 


190 


334 


VSYHIEMSFFEVYNEKIHDLLVCKDENG 
QRKQPVRLKQFI ICFEFLFL 


11748 


25649 


A 


11851 


198 


352 


SRQGTWGIHGLECFFVCLFVLFFETESH 
F VPRLECSGAI SAHCTLHLPGEL 


11749 


25650 


A 


11852 


371 


1 


P P KKLG I Q VFT P FAPRAF WP VL VF KKKG 
FPQGNLVLPGFFSKVSIQGVPPFCRKPF 
SGWGEAFQIGFFINRGKKFFFFPLGFFIi 
KKVFLKFLAKFFFFFFFFFETESHSVAQ 
AGVQWRSLGLV 


11750 


25651 


A 


11853 


176 


3 


KKGFPLDWNSMVKNIIVKFGEDLQISKT 
LHTFFFLTESRSVAQAGMQWCDLGSLHS 
C 


11751 


25652 


A 


11854 


281 


375 


QLTFKKYFLGWARWLTPVIPALWEAKAG 
RSP 


11752 


25653 


A 


11855 


146 


356 


KCGALIAEIEVPLFSELRDFLLAYSRMT 
E L C F KRC VP S LHHRALDAVDDD CLHS C S 
CTDYP INRSLMNAY 


11753 


25654 


A 


11856 


136 


1 


LS ILCEFFNWLIEMEYRSVSQAGVQWCD 
LSSLKPPPPRLQQCHSC 


11754 


25655 


A 


11857 


277 


361 


I VLRGAMWLTPVI PTLWEAEEGGSPEVR 


11755 


25656 


A 


11858 


295 


152 


WFGFGFFETESHFVAQPGVQWCNLGSL 
CSLRLPGSSNSPASAFQVA 


11756 


25657 


A 


11859 


1 


342 


GTRLPEPLTVLWTAHLGAMAPGSRTSLL 
LAFALIiCLPWLQEAAAVQTAPVSRLFDH 
AMLQAHRAHQLAIDTYQESEETYIPKDH 
KYSLLHDSQTCFRFSDS I PTPYNMEDTQ 
HK 


11757 


25658 


A 


11860 


244 


332 


TI EMMLDIKQ I QVI FLFEFKMGRKIAET 
T 


11758 


25659 


A 


11861 


1 


339 


GTRWT I CQVQLHAYAHFLYFFEMEARS 
VAQAGWWCDLGSLQPPPPGSSSSSSSS 
S 


11759 


25660 


A 


11862 


311 


394 


GMLGAVAHACNPS TLGGLGGW I TCSQE F 


11760 


25661 


A 


11863 


130 


1 


VPPSVRTFFFFFFETESRSVSQAGVQWH 
YLGLLQAPPPGSTSC 


11761 


25662 


A 


11864 


3 


320 


IMM YALFLLS VGLVMGFVGFS S KP S P I Y 
GGLVLI VSGWGCVI ILNFGGGYMGLI V 
FL I YLGGMMWFG YT TAMAI E E Y P E AWG 
SGVEVLVSVLVGLAMEVGLVLW 


11762 


25663 


A 


11866 


165 


309 


GL I LL PMLE CSDR I S AHCS L YLLGS YDP 
SDSGS HVAGTTGT CH YAWLR 


11763 


25664 


A 


11867 


208 


314 


GSRNKLSGQAQWLT P VI PALWE AE VGGS 
PEIRSSR 


11764 


25665 


A 


11868 


102 


1 


KKFFFFFETESHSVAQARVQCNLGSLQA 
PPPRFT 


11765 


25666 


A 


11869 


126 


3 


NNALQLHSSYCKKPFFFFLKTESRSLTK 
AGVQWCDLGLLQ 


11766 


25667 


A 


11870 


1 


273 


KQLPVNFLNWVRLELTGLLCTLNLSKPC 
MIFIILVIVKYWFLLFCNIFKNHIFSQA 
QWPT P VI PALWE ADMGGSHG YW I TT I VD 
FMCATLT 
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nucleotide 
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residue of 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GJutamine > R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11767 


25668 


A 


11871 


126 


2 


FPKALLWFFFFFFFFETESRSVAQARV 
QWHNPGSLQCPLR 


11768 


25669 


A 


11873 


269 


9 


QVSGKFWPDIFYKTGEKGLQCPMFPQGH 
P PRGLKKKS FS KKKKKKKKKS I AVGQAQ 
WLIPVIPALWEAKVGRSPEVKSSRPACP 
TW 


11769 


25670 


A 


11874 


114 


1 


LEAKPGVFQSGVLLVIFFFFETESHSVS 
QAGVRWCDLG 


11770 


25671 


A 


11875 


146 


1 


GNHL S S RVG VQDQ P RQHGKT LSLQKIQK 
VAGPGGAHLQSQEVQVGGSL 


11771 


25672 


A 


11876 


2 


159 


S LQF FFFFFRWS FVLVAQAGVQWHDLGS 
PQPPPPGFKHSPASASQVAGRGGR 


11772 


25673 


A 


11877 


124 


240 


FRAFSSGQAWWLTPVIPALWEAETGGSL 
EARSSRPDWRT 


11773 


25674 


A 


11878 


135 


5 


QVLFYFYFSDSFTLSPRLECSGVTLPHC 
NLCLPGSSDSCASAS 


11774 


25675 


A 


11879 


170 


2 


GFFFPEKWGYKNSFLCFFFFFFFEPEFC 
S VAQAGVQWLHLGS LQPPPPVFKVDAAP 


11775 


25676 


A 


11880 


97 


267 


GHGHATLRGLCVLSLFFHI PAPS VSGTS 
DAEECCLCVIYRLICGYISRNCL.YFSDH 
R 


11776 


25677 


A 


11881 


1 


292 


LPEPLRVLWTAHLQAMAPRSRTTLLLAF 
AlaLCIiPWLQEAGADQTVPLSRLFDHAML 
HAHRAHQLiAIDTYQEFEQTYIPEDQKYS 
FLHDSQTYFCFSD 


11777 


25678 


A 


11882 


93 


2 


KYQMGWAWWLTPVI LALWEAEAGRSPET 
SC 


11778 


25679 


A 


11883 


145 


2 


PPLGLRLQVQAPTPGFFFFFFETESRSP 
RLKCKGAILAHCNVCLLLV 


11779 


25680 ■ 


A 


11884 


202 


1 


TWWRWGVTVIiVRLVMNCRPCDRHKSASQ 
LIGRVRQENGLNARVGGCSEPRSRHCTP 
VWVTSENPSSC 


11780 


25681 


A 


11885 


42 


155 


GERSGLS PGVQDEPGQHSGTSSLQKILK 
LAGHGGTCP 


11781 


25682 


A 


11886 


215 


1 


S TMARHC P L S PML FNS AME VL VRA I S QE 
KE I QG I Q I GKEE VQL S L FTDDM I FNLE K 
RKDCSKNLLQLMNLV 


11782 


25683 


A 


11887 


214 


1 


GYFFGLNEVLGKLEKPSLKVPQNPSPKR 
PFFLGFFFFETESRIVARLECSGAISAH 
FNLCLPGSSDSPVSC 


11783 


25684 


A 


11888 


215 


3 


WGPGFPFFSFFFFFLRQGLAVIRIiKCSG 
TITAHGSLNFLGSSDPPASVSLVATTTG 
HEPSITQFHSHGSC 


11784 


25685 


A 


11889 


118 


1 


RFFIMGENPTKFFFFFETESLLPRLECS 
GVISAHCNLSC 


11785 


25686 


A 


11890 


267 


83 


HCLRSGVQDQPGQHGKNPSIQKIQKIiAT 
S FKKSLKI VI PPLPFKNDKTDSKS S CSS 
LSPHS 


11786 


25687 


A 


11891 


134 


2 


DRLAVLPRLECSGMIFLLPLPSRFKRFS 
CIiSLPSSWDYRGAPRA 


11787 


25688 


A 


11892 


116 


1 


S KGVGHFL F FFFFFETES RS VAKTGVQ W 
CDLGSLLCLV 


11788 


25689 


A 


11893 


397 


475 


RFVCST I KVLRDLS SDRSNPGRFLST 


11789 


25690 


A 


11894 


115 


331 


KNVCLFVFVEMKNHLNPGDEGCSKPRSR 
HCTPSWAAQQDS ISKYI CIYRYLYLDIY 
LSKLWRLRINQLPNS 


11790 


25691 


A 


11895 


134 


251 


INPPVSRKKKKKKKKKKKKKKKKKKKKK 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine 3 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKGGGL 


11791 


25692 


A 


11896 


186 


402 


PGKNTLTLENVPRENKVGEKAPGQNEPPV - 

LGGGEYQDPGGNVKGVGPPPAPGFGKDG 

PKRLADTFDLIEGDGD 


11792 


25693 


A 


11897 


100 


2 


KGPPFFFFFFFFFFFFFFFGQSGQVKLK 
SPKCK 


11793 


25694 


A 


11898 


92 


3 


SVARIMPYALFLLSVGLVMGFVGFSSKP 
S 


11794 


25695 


A 


11899 


283 


362 


MTYTMTIVHALWASVCLLLNHAHDPLD 


11795 


25696 


A 


11900 


248 


1 


VM S AQPGL S V I L RFMG KW SNWTGMQ Y TE 
SEVERYRDRGRARETERQRDRERERENP 
RNRKLFFYGRFKHSPQDFFMPQKVHF 


11796 


25697 


A 


11901 


253 


2 


KLQENPFKLLNF I INLCSVSLNVP I ILN 
INIKPSF I APKPRC P S KFQRNY IYIYPS 
IYLSIYLS JYLSIYLPITISKVIiKDVE 


11797 


25698 


A 


11902 


179 


3 


LAVTKNE IMGFSTTWMEMGDINFFFFFF 
ETKSHPVMRLE CSGVI SAHCNLR I LVS S 
SC 


11798 


25699 


A 


11903 


1 


403 


GTSSQESFGGCCVSGLIAMGTKAQGERK 
LLCLFILAILLCSLALGSVTVHSSEPEV 
IIPENNPVKLICAFYDLSFFSSSFSYFF 
HLSFFI YYP YYLFVS CFTFYIVI LLS F I 
YSHHLSSVSYYSYVLLIYTFIL 


11799 


25700 


A 


11904 


37 


242 


KGPTRDQLQHPKARLPAPLRVLWTAHIaA 
AMAAGSRTSLLIiAFALLCLPWIilYDGAS 
HIVSIiFMLFIIF 


11800 


25701 


A 


11905 


1 


298 


GT S QHRGRKD S RTGSH S S S DHPGAiCLL S 
TEEKQAAETMRPPSAPPRRGCIPWQGKL 
LSSS 


11801 


25702 


A 


11906 


225 


30 


ACYLQKDGAFAILPDVLPDMFKGHSSLY 
PCQHAE CDH I KW I YNC VC VCVCVCV C VC 
IVICKLNV 


11802 


25703 


A 


11907 


196 


320 


HSSSS TTPSQDTHTHTHTHTHTHTHTHT 
HKIPQRELLPSVPD 


11803 


25704 


A 


11908 


130 


3 


NGFVFS PFP FFFFFEME SHS VTQARVQW 
CDLGSLQSLPPGFK 


11804 


25705 


A 


11909 


381 


2 


AS I CGQKLI FFQHWKKI ILVQQVGLQKK 
NSTLKRAWEFLSFFQPAPPLGSPLKGIN 
ILNKGAKKLKGKTNFPGPIVFFFSVLPP 
FFGLKKFQKKVFHQGKSPFFFFFFFETE 
SRSVAQAGVQWHDL 


11805 


25706 


A 


11910 


138 


2 


SVPLKEFI ISQARWLTP I I PALWEAEAG 
RSPEVRSSRPASTWRNP 


11806 


25707 


A 


11911 


231 


54 


IRAS FGIQIRI CLTIFFFFETEPHS VTQ 
AGVQWRDLGSLQPPPRRQEQNSVSKQTK 
TN 


11807 


25708 


A 


11912 


3 


461 


DAWGRVEGPPLRP PATSRRWAG P TL WRM 
E VTGDAG VP E S GE I RTLKP CLLRRN YS R 
EQHGVAVS CLEDLRS KACD I LAI DKS LT 
PVTLVLAEDGTIVDDDDYFLCLPSNTKF 
VAMAINE KWAYNNS NGGTGW I SQESFDV 
FEAYSGATLLFFF 


11808 


25709 


A 


11913 


318 


407 


LCGRLWWLMPVI PALWEAEAGGLLKLRS 
MR 


11809 


25710 


A 


11914 


340 


5 


DRVAKFSEARLFRLFFPFFPLKIFCFPR 
GFKIFRGVCPLLFPPEFWGLFQKGPGMR 
LFFPPLGGFFFFFFFFFFFFFFSFFFFF 
FFPFFFFFFFFFCHKNKTLLKKVHNS I 



1261 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 
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Predict- 
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nucle- 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine 7 
G=GIycine ? H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11810 


25711 


A 


11915 


274 


385 


IKPQRWGETRAEKLKSLKIRVPLLLQRN 
AAPHQQWNK 


11811 


25712 


A 


11916 


259 


441 


KKKKKKKKSSSLRG 


11812 


25713 


A 


11917 


254 


402 


LI VS VIDFLRWRLPLLP KLE CSGMI SAH 
CRLRLPGSYHSPASGCQLLiGR 


11813 


25714 


A 


11918 


223 


1 


NTNS PRKKFF YE E VS SVPVF FGVFP P S P 
LKGSPRAFLKLAWRPPLPLFFFLEMESR 
SAAQPEVQWCDLGSLQPP 


11814 


25715 


A 


11919 


97 


3 


KPFFFP FFFFFFE I ESRS VTQAGVQWRD 
IiGS 


11815 


25716 


A 


11920 


154 


384 


KEFFLMLFFFLPPPPPFFFFFFLGKGLF 
FFPPGGGGGGQFFSIGPPPPRGKKDPPP 
SPPKMGGKRGTPPPPGYFFFF 


11816 


25717 


A 


11921 


225 


369 


GTLNLLTYKIKSWSGAVAHYNPSTLGGW 
GKWITSGQEFETSLANWKP 


11817 


25718 


A 


11922 


116 


372 


MEYTWNKMAMSYNLiHIYQTYKNSVIEHP 
TGVLPHSRVIMVNNIVLCISKQLKELLR 
RLMWE EHLS PGGGGCSE PRS RHCTPAWA 
T 


11818 


25719 


A 


11923 


271 


409 


KKKKKKKKEKKKKKKKKKKKTGGGA 


11819 


25720 


A 


11924 


121 


228 


KKKKKKKKKKKKKKKKKKKKKKKKKKKR 
GGGFKKKPWGGQK 


11820 


25721 


A 


11925 


55 


423 


NKPKKKNFLKKKKKNFFFPFYPLKFFFF 
PKSLNFFRRWPKISPPKKKFFFKNSPS 
VFFFPPLKKKNFFFLTPLKFGPPKNFFK 
RPPPLFFFFFFFFFFFLSGNCIEKL 


11821 


25722 


A 


11926 


253 


443 


YQHQRPKVDKSMKMGRNQRKKSENSKNQ 
NTS S PPKDHNS S PARQQNWMENE FDGLT 
EVGFRRW 


11822 


25723 


A 


11928 


175 


413 


KKKKKKKKKKKKKKKKKKKKKKKKAGGG 
A 


11823 


25724 


A 


11929 


490 


182 


RKQKI KGCKKPNPLAQGGVKKKGKKGPP 
FNFFLKQKGQDFLKNWNLGQKKKIPIPP 
QVFSLRQKGAFPGRIFFFFFFFETESCS 
VAQAGVQTLjSYKNKNKIK 


11824 


25725 


A 


11930 


110 


2 


KKIPRKPPFFFFFFFFYKAGSHIjVAQAG 
VQWEDLiGS 


11825 


25726 


A 


11931 


169 


987 


YLEKIMSEHSRNSDQEELLDEE INEDE I 
LANLSAEELKELQSEMEVMAPDPSLPVG 
MIQKDQTDKPPTGNFNHKSLVDYMYWEK 
ASRRMLE E ERVP VT F VKS E E KTQE EHE E 
I EKRNKNMAQ YLKEKLNNE I VANKRESK 
GSSNI QETDEEDEEEEDDDDDDEGEDDG 
EESEETMREEEGKAKEQIRNCENNCQQV 
TDKAFKEQRDRPEAQEQSEKKISKLDPK 
KLALDTSFLKVSTRPSGNQTDLDGSLRR 
VRKND PDMKELNLNN I EN I RK 


11826 


25727 


A 


11932 


161 


389 


SVQTHPNLRSCSVLKNAMHFYLLGYVIS 
GCTEPAKAI KP IDRKSVHQ I CSGPWLS 
LS TAVKKI VGNSLDAGATN I 


11827 


25728 


A 


11933 


134 


289 


ASTQNMGQMLRLMSSPTLLSLSLSHTHI 
HKHTRMHTHTHTHTHTYTPTKKV 


11828 


25729 


A 


11934 


128 


1 


IKKGEFFFFFFFFETESHSPRLQCSGTI 
LAHCNLHLLGS SNS 


11829 


25730 


A 


11935 


131 


19 


MYIGWVQWLTPVI PALWEPKVGASLEAR 
SLRPARTSE 


11830 


25731 


A 


11936 


104 


3 


LKGQLIRAQWLMPVIPSLWEAEMGRSPE 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














VGSSR 


11831 


25732 


A 


11937 


146 


2 


GGRLGI I YHAWNPEGFFFF FEME S CS VG 
QAGVQWCNLSSLQPPHPGFK 


11832 


25733 


A 


11938 


149 


1 


DKCFLC VCCLF I FI YLLI YFYFFETES C 
S I AQAGVQCCNLGS PQPLPLA 


11833 


25734 


A 


11940 


176 


380 


QSVAQLPRLECNGMILAHCSLHLLGFKR 
FSCLSLPSSWDYSRLRNYQGGRRWSKTH 
QAWCQSLTIIiWN 


11834 


25735 


A 


11941 


133 


415 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKSG 


11835 


25736 


A 


11942 


222 


420 


QGDKFLDNNNSTILYMEKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKYSR 


11836 


25737 


A 


11943 


163 


425 


KNPPKNSQLSPSTLKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKTG 


11837 


25738 


A 


11944 


143 


3 


KMGKKQ S R KTGNS KKQ S AS P P P KE RS S S 
PAMEQSWTENDFDELiREE 


11838 


25739 


A 


11945 


200 


3 


QNSQVGYKGAPHKEKTMALQARVNLGPP 
RGPLKRPALFFFFFETESPSVTQDGVQW 
HDLGSMQPP 


11839 


25740 


A 


11946 


221 


3 


RKIEGGPSTSRKSVRGVRCFQPLGLGFP 
GGLFFFFFFYETESCSVARLECSGWSA 
HGILHLRS TNEGRRGR 


13840 


25741 


A 


11947 


68 


177 


ISSSFKNKChCEKKKKKKKKKKKKKKKK 
KKKKKRKR 


11841 


25742 


A 


11948 


210 


329 


RHTHTHTHTHTH ILSLS LSHTHTHAHTL 
MYFLAF FDLRS 


11842 


25743 


A 


11949 


183 


20 


LGTVAHSCNFVFSLSRGFLHVGKNGLRL 
PTSDDPPTSASQSAGTTGMSPCTWPE 


11843 


25744 


A 


11950 


232 


329 


VG I KMS I S SDEVNFLVYRYLQESGFSHS 
AFTF 


11844 


25745 


A 


11951 


412 


239 


LFYEKGSRFVSQAGLELLELKQSPCFGL 
PKCWDYRHEPPRQADLPFCVPSSAWASW 
L 


11845 


25746 


A 


11952 


310 


393 


PHTDISGTPEI MH YVHVHR VTTQ PRNKP 


11846 


25747 


A 


11953 


193 


3 


LSSWPTLISGAFKAEENAVEGNDDSRIL 
RGFFVLFCFVLFFRQSLALSPRLECSGV 
ISAHCNL 


11847 


25748 


A 


11954 


185 


416 


S VQTH PNLR S CS VL KNAMHF YLLGT E PA 
KAI KP I DRKSVHQ I CSGP WLS LS TAVK 
K I VGNS LDAGATN I D L KLKD Y 


11848 


25749 


A 


11955 


100 


1 


AHLSKVFPIFFFFFEMESHSVTRIiECSG 
TISAH 


11849 


25750 


A 


11956 


35 


521 


KEKFFFHAGVYWGPPRNFLKRAPLFFFF 
FFFFFFFFFFSFFFS 


11850 


25751 


A 


11957 


214 


1 


KKKIFFFKEIFYPKLSRKKKNFFRGAGK 
ILNLSIFPEKKKIFSPFSFFFETESCTV 
AQAGVQWHVLGSLQP 


11851 


25752 


A 


11958 


114 


1 


RYFSGQAQWLTPVI PALWEAEAGRSPEV 
SSSRPAWLTW 


11852 


25753 


A 


11959 


188 


12 


S KCQ CTF FSNFKKTEME SHYLAQAGLEL 
LGF SHL P T S VS QT VG I TGVSHCAQ PNAH 

LY 


11853 


25754 


A 


11960 


241 


556 


SSIPFLPNKHLLLSFSTLSSLGREVSIQ 
DMCQGTYQKQPFPFPLPHVFHLS IRDGE 
LCVGRDVHLTCQTDLAQVFVCLFVLGVL 
LSLPRAGVQWCHLSSLQPPPP 


11854 


25755 


A 


11962 


362 


462 


KIGQALTPIIPLLWEAEAGASPEARSSR 



1263 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S= Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /—possible 
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SAWPT 


11855 


25756 


A 


11963 


43 


2 


CCCYCCCCCCYCC 


11856 


25757 


A 


11964 


30 


415 


GLGFKQKIFFFFFLGQGPNFDLLRAKKR 
PFLPPPFFPKNPKISrRGVFLLGPSGVPKP 
RPRQGDVPPLKPRSLGVFLTSKKTPLGP 
WANPPPPPGDGP 


11857 


25758 


A 


11965 


149 


2 


SKTWLKLAKYLKNTYPGWVRWL IPIVPA 
LWQAGAGELHEPRNLRPAWAT 


11858 


25759 


A 


11966 


179 


3 


KKNI FFPPP VKFGP PQGFFKRP P PLFF F 
FFFFFFFETESRSVAQAGVQWLS IGSLQ 
AP 


11859 


25760 


A 


11967 


245 


382 


DVTDS KMHPDFRS S HVKMWLGWAHACN 
P S TLGGSRGQE FKTS LAN 


11860 


25761 


A 


11968 


264 


10 


LSTLE CKGVS PGVF S PKGWRKGLGYI F S 
P FLE KPH I P PLG I S M YFFF FFDGVS VAQ 
AG VQ GRDLGS LHL P S PGS SYS CAS AS LS 
S 


11861 


25762 


A 


11969 


326 


406 


RLLKGWALWDTPVIPTLWEATVGRSPE 


11862 


25763 


A 


11970 


120 


1 


KGYGFPFPPKKPLGPPFFFFFATESCSV 
AQDGVQWRDLGS 


11863 


25764 


A 


11971 


84 


370 


RNGAELKTPLAGQTQITINCRYSWVTLV 
LGDLES I LCWHI FKKTNLF I CGFFSSFL 
LGAGRMKSHCWRLECRGMISAHRNVCL 
LGSNDSPCSAF 


11864 


25765 


A 


11972 


196 


3 


SRGKMGEYPFVLFKNPSWAKAGGNKGNP 
F FFFFEAE S CS VAQAAVQW CNLGS LQAP 
PPRFTPRA 


11865 


25766 


A 


11973 


559 


644 


KIGCSAYGVAILLFIiYFFNKLAFTLWKK 


11866 


25767 


A 


11974 


3 


391 


HEAQLPEPLMVLGTGHLAEMAGGSRTYL 
LLAFGLLCLPWLQEAGAAQTVPLSRLFD 
HAMLQAHRAHQLAI DT YQELEE T Y IPKD 
QKHSFLHDSQTSFCLSDSIPTPSNMEET 
QQKSNLELLRISLLLIE 


11867 


25768 


A 


11975 


138 


365 


LKFECHSTLCANHGKQTFTFVFYNKFLL 
KYTFYFS VLT YAINWYI KNL INVFKKKK 
KKKKKKKKKKKKNFKKKKN 


11868 


25769 


A 


11976 


275 


433 


MS ILDLS KARNFFLS FLETGS CS I TQAG 
VQWLNHSSLQPQTPGLRDPPASASQ 


11869 


25770 


A 


11977 


108 


3 


RCGLFFFETRASPCRPGWHNLGSLQPLP 
PRFKRF 


11870 


25771 


A 


11978 


270 


408 


FENNVWLGRAQWLTPVI PALWEAQAGRS 
PGPENQNHPDYKGNPWPY 


11871 


25772 


A 


11979 


228 


441 


QALKFVI EMLLSLKKCLDVSVI FNRHKK 
IELLQKKKKKKKKKKKKKKKKPSQKKKD 
SS RGKDS 


11872 


25773 


A 


11980 


2 


447 


GAIoALNKTTADVWRLNFLVSGLHWKRWI. 
QQTSLSKWVKIKECS ILKKKKKKKKKKK 
KKKKKKKKDKK 


11873 


25774 


A 


11981 


124 


445 


KYGEMSQNPARGGPKFSKNQKYSEHLRI 
HCCPPFTFLISKKE IGDRKYS I CKSGCF 
YQKKEEDWFCCPCQKTKTSRRAKSLKRP 
KQKPVAP PGGVKAPAKPRSLPRF 


11874 


25775 


A 


11982 


48 


429 


KESNGSQDRLLPKIHSPDVSGGKSGGMS 
QNP ARGG P KIiS KNQ KS S EHLR I HC C P PL 
P FL ISKKE I GDRKYS FGKSGCFSQKKKE 
DWICCPCQKTKLKGKIRPPPKKNGPGGS 
LNGRTT WVS GLFVHN 
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11875 


25776 


A 


11983 


419 


500 


SLLVKVEKQWPGAVAHACNPSTLGGPG 


11876 


25777 


A 


11984 


362 


496 


LWSPIRI LLLRWAWWHVPT VPAI WE AS V 
GRLLEPRSLCNMVRPIS 


11877 


25778 


A 


11985 


142 


433 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKGGGGVI KKFLGGP I FGGGG 
KEKFFFFGGGFIN 


11878 


25779 


A 


U986 


177 


2 


CQRRWVTLSHHTHS ITRPLGQLKKGWS P 
GAVAHT YNP S FLGGQGG WVT YVHRTRGR 
TR 


11879 


25780 


A 


11987 


178 


443 


DKKKKKKKKKKKKKKKKKKKKKAGGGLiF 
KKI 


11880 


25781 


A 


11988 


329 


410 


LIFFFIFFERESHSVAQAIVQWHDLGS 


11881 


25782 


A 


11989 


146 


1 


DRASALQPGRQGRNFVSmKivJNNVI YI I 
NTLTTTIFLGTTYDALSPEL 


11882 


25783 


A 


11990 


240 


420 


GVFAPLLGDVSQSGYTGFRDPLEEAVCL 
FLELECHAERTTALFRAVRQGCLSLQNL 
SVAF 


11883 


25784 


A 


11991 


239 


410 


GTLFIYLLGDLFIYFRNKSLTLLPRLEC 
GGVIVAHCSLDLLGSGYPSISALPSSWD 
C 


11884 


25785 


A 


11992 


126 


3 


YTGEYKSFCHKDTCTHMFTAALFTIAKT 
WNKPNGRVGGRV 


11885 


25786 


A 


11993 


300 


22 


SQLLGRLRQENHFNWGGRGCSEPRSCHC 
IPAWATRANS 1 FCGFQASSVEVRRSARK 
KLFSDILKRHNTISWRVSGLLIiVDSYFG 
RLATPVRAQ 


11886 


25787 


A 


11994 


303 


2 


EGEEIFLPSPPFPPGQENPGSNCSPPLF 
P FGTFLD I RVAGSHKVQAKGVFKKRP PS 
LHLFLIKKRFFFFFFETESRSVPQAGVQ 
WRDLGSLQTPPPGFKR 


11887 


25788 


A 


11995 


171 


457 


SFSDHLILGWIGFCNNLSKIALCSSPIK 
NKNDDLQKKKKKKKKKKKKKKKKAKASS 
SYQDSS 


11888 


25789 


A 


11996 


357 


204 


EKTGFPHVGQAGVDLLTSRSTRLSLPKC 
WDYRHE PMRLAGHIiYYF YPQMK 


11889 


25790 


A 


11997 


219 


I 


PRVFWAPPPRYPPGALFWAPRPVGVSLG 
ARAPTKAGPKQKGARGTFFFFFEMESHL 
S P RLE CNGT I WAHCNL C 


11890 


25791 


A 


11998 


254 


1 


AHLRGNRQLPKHTFFQYMTTNLKCAFSV 
GRQSYSIPWRSFYTSLFFKYSCVFHTHT 
HTHTHTHTHTLYFQIMVLLLPSLRKKGS 


US91 


25792 


A 


11999 


167 


2 


NFKS F FQGL S RG VLNPKVNL FFFLE TE S 
CSVSQAGMQWGDLDSLQPPPTRPPTRP 


11892 


25793 


A 


12000 


104 


3 


RPWTFFFFSETKSRSVAQAGVQWRDLCS 
LQPPP 


11893 


25794 


A 


12001 


178 


2 


KI FQRGAKNS P WGLPFRGGENKKGAP PC 
QKGFGRFFFFFETESRFITQTGVQWCDL 
GP 


11894 


25795 


A 


12002 


163 


282 


GIGGE WCLSKYVI KYVKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKK 


11895 


25796 


A 


12003 


195 


2 


KWGRIPLKKGGFCFTKKKSGQTIjLjIKSP 
PPAFFVFFIFFFLETESRSVAQATVQWG 
DLGSLQAP 


11896 


25797 


A 


12004 


152 


1 


GFVDVSAEKLRPKDFMTKTPKAMATKAK 
I DKWALTKLKS FCTAKET I IRV 


11897 


25798 


A 


12005 


359 


1 


EWPIWPPPKRRVLSKVSKQFFISAPIR 
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KKP F P CQ VGL I LAP FRVLL I GRP FF F F F 
GVLWE I QRWS F I S LWVWRARIMMYA 
L FLLS VGLVMGFVEFS S KHCPVYGALVL 
VSIGGRV 


11898 


25799 


A 


12006 


189 


12 


DGRLPGSVCYYLIFLRQSFTLLAQAGAQ 
WCHLG S LQ S P P PG FKRGRRVS R I RAVRG 
DQ 


11899 


25800 


A 


12007 


131 


3 


KI FFFFFFFEMESHSVAQAGVQWRDLGS 
MQDPPPGFMPLLFY 


11900 


25801 


A 


12008 


150 


2 


IILFLVPIEAVFFFFETVAEAGVQWHNL 
SSLQPPPPGFKQSPASASRVA 


11901 


25802 


A 


12009 


305 


3 


KFFFSLKASIRGRWFGLFFTPPKKGFLP 
KI PHQVFKGGPLWEKLQLGKAGLNLGPY 
KGFFKGAARFFFFFFETESRSVAQAGVQ 
WRDLAHASVHASAHAS 


11902 


25803 


A 


12010 


263 


517 


DKGFQLPPRWCPGQDPLFFYGGIVFHGG 
YGPPFFFFFFKAESRLVAQAGVQWQDLG 
SLQPPPCRTRGSP 


11903 


25804 


A 


12011 


107 


374 


WMGTWGSLPVLLTTRSTCPPRSPRLHRR 
TGATIWIFFRLFETGTVLPMLECSTVIM 
AHCRLQLPGS S S S PTIAS QVAGTKAHLS 
DCFVY 


11904 


25805 


A 


12012 


144 


261 


FPKKINNFYFTFLQDPTGI FSLDKTIGL 
GTYGRIYLVS 


11905 


25806 


A 


12013 


371 


3 


IFHLRKIFTFLPGIFWPIRLLSEKKKPK 
RFWVKI KKFKNGYPLKNP I NPKYKGPRF 
FFKGEKNPNGKVPVLKKIKIPSFSKALG 
FLQKKASKPFFFFFFDTESCTVDQSGVQ 
WCDLGSLQAP 


11906 


25807 


A 


12014 


101 


3 


RGAFFFFFFETRSRSVIQAGVQWCDLGS 
LQAP 


11907 


25808 


A 


12015 


205 


2 


VFNPSGINVMYVDAI ILNKRIiATRIQQH 
I KQR I LHDQVGF IHGMQG WFS IRKSINV 
IQHINRPKDKN 


11908 


25809 


A 


12016 


168 


1 


GCVCQFQKKLGNGGLNGFFFPLFFFFFF 
EMKSRSWQAGVHWRYLSSLQTPPPEFK 


11909 


25810 


A 


12017 


311 


2 


RGLRFGQWKTQMENPNTP FSALRPGQLS 
SIRSGLLHTYPGLGDRSEPLSCS ILSSS 
KYWWQAVTSALS SNKPGASQGHWKDDF 
FLFFETESCSVTQAGVQWH 


11910 


25811 


A 


12018 


383 


247 


LVEMGFRHVGQDGLDLLTSQSAHLGLPK 
WWDYRREPPRLATIEVL 


11911 


25812 


A 


12019 


2 


379 


RVLWTAHLAAmAGSRTSLLLAFALLCL 
PWLQEAGAVQTVPLSRLFKEAMLQAHRA 
HQLAI DTYQE PIS SWGMEAY I TKEQKYS 
FLHDSQTSFCFSDSIPTSSNMEETQQKS 
NLELLHISLLLIES 


11912 


25813 


A 


12020 


3 


389 


PEPLRVLWTAHRGAKGAGSRTSLLLGFA 
LLCLPWLQEAGAVQTVPLSKLFDHAMLQ 
AHRAHQLAIDTYQE FEETYI PEDQKYS F . 
LHDSQTSFCFSDSIPTPSNMEETQQKTN 
LELLRI FLVLIELWLDP 


11913 


25814 


A 


12021 


247 


386 


PMLGHVSQSGGNGVRDPLEEAVCPLAKL 
KHCSGRSTALFRAGRQKR 


11914 


25815 


A 


12022 


17 


371 


PLRDLWTAHLEAMAPS FRTDLLLAYALL 
CLP WLQE ARAVQT VS LYRLS DHAMLQAH 
RAHQLAIDTYHE I EETYILKDQKYSFLH 
DSQTS FCFSDS I PTPSNKEETQQKFYLE 
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LLPISL 


11915 


25316 


A 


12023 

■ 


3 


359 


LRALWTADLGAKAPGSRTFLLLASALLC 
LPWLEEAGAGQTVPLSKLFDHA1LQAHR 
AHQLAIDTYQELEETYIPKDQKHSFLDD 
SQTSFCFLDS I PTPSNMEETRQKSNLEL 
FRTSLLL 


11916 


25817 


A 


12024 


2 


363 


PEP LKGL WTDHLGAIiAP S SRTS LLLAYA 
LLCLPWLQEAGAGQTVPVYTLFDHAMLQ 
AHRAHQLAIDTYHELDETYMPEDHKYSF 
LHDSQTCFCFSDS ITTPYNMEETQHTSN 
VELLRISL 


11917 


25818 


A 


12025 


314 


393 


GQAWWLT PVI PAFWEAKGGRS LEARS 


11918 


25819 


A 


12026 


80 


392 


PLIACSFFLFLFFFFGKKILFLPPRWKG 
GGGIWVNGNPGFRGQAFFLAS S S KKPGM 
GGPPLTPGKRFVFLKKKGFSPGGPGGSK 
FSAPGTPPTGPPKGWEIPAK 


11919 


25820 


A 


12027 


165 


2 


WQKLLFHFGTESCSVARVGVQWRHFSSP 
KP P P P E F KQ L SAP AS RVADRR P PDAW 


11920 


25821 


A 


12028 


1 


338 


VFVALKFLMCLLSVCIFSLENIYSNSLP 
QKNCLPLIFFFLFFEMEACSVTQAGVQW 
GDLGS LQ P PVSHLNLGGGGCCE PRE CHC 
TPCSRPGDRDFVFNKNTMIQDNHLMELT 


11921 


25822 


A 


12029 


2 


315 


HEERERERERERERERERERERVGRTVG 
G VGET Y KAE L PRVGGAAQ KRAH FS ARGR 
LFMEICGDMCGEKPGKPPHSLTEECIiSR 
CGERFFDTSLAITRGCAQSV 


11922 


25823 


A 


12030 


216 


3 


ERIIPRGVRQRNRLFSGRDTASPPLFYP 
PPRQIHKRGVEDREHERRGERERERERE 
RERERERERARAAR 


11923 


25824 


A 


12031 


114 


329 


QTERNS ININKKDIHTKTPSVGHQHQRP 
KVDKTTKMGRNQSRKAENSKNWKAS S P P 
KEHNS S PAREQNWMES 


11924 


25825 


A 


12032 


177 


1 


IKKKTFFWQNIPLFSKKKTCRQKFFFFF 
FFETGSIiLPRL,VCSGAVLAHCSLCIiPGS 
AFL 


11925 


25826 


A 


12033 


127 


1 


PSFFFFFFFNETESRSVAQAGVQWRDLD 
SLQAPPPGFITPSC 


11926 


25827 


A 


12034 


47 


314 


GAPVASVSISCPSCSATDGWRNGKSTA 
GHQRYLCSHCRKTWQLQFTYTASQPGTH 
QKI IDMAMNGVGCRATARIMGVGLNTXL 
RHLKN 


11927 


25828 


A 


12035 


15 


408 


GAI PGAMGHFTEEDKAT I TS LWGMVNAE 
DAGGETliERLLVDYPWTQRFFDSFGNLS 
S AS A I MGN P KVMAHGKKVL T S LGDATKH 
LDDLKGTFAQLSELHCDKLHVDPENFKL 
LGNVLVTGMAIHLGKEFTP 


11928 


25829 


A 


12036 


122 


2 


KLVDVLPPPGGARVFFFFFFPLETEHCS 
VAKAGVQWHDFG 


11929 


25830 


A 


12037 


154 


1 


GKPPPKLGFWGGPRQRVRGRGPPFFFFF 
ETESPSVAKAGVQWCDLGSLHPL 


11930 


25831 


A 


12038 


58 


254 


DPRVRQQRTTAHCSLNLPGLQRPPSLSL 
PS GWD YRHAPLCATQKI LAYRVAQRERS 
AHGSYYQAS 


11931 


25832 


A 


12039 


270 


124 


DGVLLLIiPRLECNCAISAHGNLRLPGSS 
NS PASNKTKQNNNKKTLSNKF 


11932 


25833 


A 


12040 


63 


355 


LGRGT AHRRLWRP L S RP ASRVS YL S AAT 
NKR S FAP P S RAF P WDNKWERGGF Y F FE T 
GSHRRRPGRMECSGAITAHCSLDFPGSE 
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MGFTMFPSLVSNS 


11933 


25834 


A 


12041 


184 


3 


GVSRLLGGVSQLGYMGVRDPIiEEAVCLF 
SKLKHHPGRTTALFRAVRQGCLSLQKFL, 
LLFA 


11934 


25835 


A 


12042 


118 


3 


DRVLLiC YLGWE CS GT I SAYCNLHLPGS K 
RFSCLSLPSS 


11935 


25836 


A 


12043 


40 


436 


LSEGKLTNRKDNHTKTPSVHHHHQKPKV 
DKTTRMGRIQSTKAENSKNESTSSPPKE 
HSTS PATEQSRMENDFDELREDFRRS VI 
TNFSELKEDVRTLCKEAKNLEKRLDEWR 
TRINSMEKTINDLMELKTMA 


11936 


25837 


A 


'12044 


101 


2 


HLGQVRWLTPVIPILWEAEVGGSPEHRS 
SRPAW 


11937 


25838 


A 


12045 


123 


3 


LTVIKELPLGQARWLTPVIPALWEVKVG 
GSPEVRSSRPA 


11938 


25839 


A 


12046 


3 


432 


PIFGGGGERIFPFFFFFSQGGGVFFPKT 
KNKGFFPFLGFFKKIFLRNFFFIiFFPFF 
FIiNPFFYFGAPPFFFFFPGGGFFFPLFS 
RFFKFFFQNFPQGFFFGGGFFFFFLGFF 
FFFFFL 


11939 


25840 


A 


12047 


325 


164 


KNKRKNRGQARWLTPVI P ALWEAEAGGS 
PEVRTTGVSHGTRPHLLLNTVSEYI 


11940 


25841 


A 


12048 


187 


381 


IiMDKRVSIiWGDENILKVESNSGCTTLYjb 
LVCFEMESRSITQAGVQWRDLSSLQPPP 
SRFKRFSCL 


11941 


25842 


A 


12049 


84 


464 


S YRVP S H PDTL VL SRI SAQE AGE KS P F C 
FPERVWPCPRPLSDLGRRLKLECGPDLL 
DSTFLSFFFGEGLKTGSHSGALECSGVI 
RDHCGL CL S G S GD P P I SACLRVAGT TG V 
SHHCIFCRQDLPGS 


11942 


25843 


A 


12050 


232 


20 


LLQCSSRAKIHTSLTGNQKLEMIKLSEE 
GMSKAEISQKLGZjLHHTiGLVANAKQRFL 
KVI R SAT P E FRHAE 


11943 


25844 


A 


12051 


401 


518 


FFETESCSVTQAGVQWRNLSPIiRA 


11944 


25845 


A 


12052 


191 


1 


LFPLKKEKIFFLGPGTNGAPPMFFLKAP 
PLFFFFFFFFFFFFFFFFFFFFVNLFLL 
LSFLHILEP 


11945 


25846 


A 


12053 


12 


369 


PLPIYSACIEVGSNPQGPGIDAKSVSHN 
NCYLKEKKKKKSNGSQNNLLPKIHTPNG 
SGKKTGEKIQNPPRGGPKLSKNQKYSKN 
LKKHCCPRLTFL I S KREKGNRKYTFGKS 
GCFYQKK 


11946 


25847 


A 


12054 


89 


513 


NFTVRGS I PtiLNNI PMANLLLLI VPIL I 
AMAFLMIiTERKILGYIQLRKGPNWGPY 
GLLQPFADAIPLFPKEPLTPAPSAFALC 
FAAPALALPFALFLCPPRPLGRP 


11947 


25848 


A 


12055 


158 


3 


SQLTWPIMAQTDSMCLWIVYFDDNIGWA 
WWLMS VI PTLWEAKVGGSLEHSN 


11948 


25849 


A 


12056 


113 


1 


KSHGGHGVEVLVSVLVGLAMEVGIiVLWV 
KEYDGAALA 


11949 


25850 


A 


12057 


44 


263 


ARIARSAHEGKMPRKYTGARKNAENRPQ 
RELQLKASRSTIDLAEHACNALWYQLFL 
ISVGSWKNYILFYLTYT 


11950 


25851 


A 


12058 


142 


1 


KKIFFRYKKEGFSFPKFFRISLFFFFFF 
ETESRSVAQAGVQWRDLGS 


11951 


25852 


A 


12059 


12 


347 


QTERNS ININKKDIHTETPSEGHQHQRP 
I VDKS T KMRKNQ CKKAENS KNQS AS S PA 
KDHNCLPAKEQNWTENEFHKLTEVGFRI 
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G=Glycine, H=Histidine, I=Iso leucine, 
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nucleotide deletion, \=possible 
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ITNSVELKEHILTQCKEAKNLEQRLEEL 


11952 


25853 


A 


12060 


245 


3 


MPEGSFFLEIPEITRVFVKKPGKFLGGV 
GPSLLFFFPFFFFFFMRQSIaAVSPRLEC 
NSAISAHCNLCLLGSSDSRSSGSR 


11953 


25854 


A 


12061 


94 


3 


KGGIFFFFFFFFLRQSPSVAQAGVQWCD 
LG 


11954 


25855 


A 


12062 


108 


5 


NNLS SGVRDQPGQHGETTSLLKI QKLAR 
HGGRHL 


11955 


25856 


A 


12063 


298 


1 


KKKKINTHAQKRGPPFFFKPPPEKGKAP 
P PQRVT KKGGE KDSFIP KRRE KKNP PPL 
SFFFFLVRVHVRERERARARERERERER 
ERERERERERERERAR 


11956 


25857 


A 


12064 


122 


1 


VCEGVCVGGWVRDRERERERERERERER 
ERERERERERAR 


11957 


25858 


A 


12065 


126 


1 


ATCTKFRNDNRALLRVRVFERERERERE 
RERERERERERERESR 


11958 


25859 


A 


12066 


48 


351 


FFFFFFFFKKGPKI VPPGGGGGGDP I FL 
EPPPPPLKKFWGPPLRGRGNKGGGPPGW 
VNFGI FLKKKGLPWGPGGVKTPAFKGS P 
GPTPPKGGNNRKNPPPW 


11959 


25860 


A 


12067 


1 


365 


GTRLTVLWTAHLVAMAPGSRTSLLLDFA 
LLCLPWLQEAGAVQTVPLSRLFDHAMLQ 
AHRAHQLGIDTYQEDEETYIPKDHEDSF 
LHDCQTSFCFSDSIPTPSNMEETRQKYH 
LELLRISLLF 


11960 


25861 


A 


12068 


174 


1 


PAWVMQHNPVSLFFCFFETESRSLTRLE 
CSGT I S AHCNVRLPGS SDS P VS P SRVAA 
RA 


11961 


25862 


A 


12069 


270 


375 


TRI KRCNGGRAQWLTPVI PALWEAKAGE 
SPEVRSS 


11962 


25863 


A 


12070 


197 


350 


KKIYIFLTINTIMDLIIPFLLDNILGFW 
LGAVAHACNP S TLGGRGG W I TRS 


11963 


25864 


A 


12071 


3 


378 


HEGQLPEPLKGLWTAHLLGMAPGSRTSL 
LLAIDLLCLPWLKEAVAVQTVPLSRLYD 
HAMLQAHRAHQLAIDT YQE IEETYI PKD 
QKLS FLHEYQTS FCFSDS I PTPSNMEET 
LHKSHLELLRIYL 


11964 


25865 


A 


12072 


219 


1 


FSHPFPPGVFQTPLVFPRPWEVLGSPGG 
GGGPPKKWAGVQKWVFFFFFFFWEMESC 
S VAQAGVQWYDLGS PRA 


11965 


25866 


A 


12073 


134 


3 


VFG PPFFFFFFCETE S HS VAQAGVQWRD 
LCSLQAPPPGFMPSC 


11966 


25867 


A 


12074 


175 


357 


MPINQPVKKMCVCVCVCVYIYIHTPLYI 
RVYMYMCM YI CVY IHVYIWMC I Y I HTH V 
STYVW 


11967 


25868 


A 


12075 


193 


350 


KIPHLTSLYHTQNYLKYC IGQAWWLTPV 
I PALWEAEAGGSPEVRSSRPAWPW 


11968 


25869 


A 


12076 


252 


1 


GVFGP FPKKGGF FWVKGFLGTP Q I FWVG 
VFFKKKGGALWVKGLLGKPPPPPLFFFF 
FETES CSVARAGVQWRDSVKKKI TIARA 


11969 


25870 


A 


12077 


273 


366 


IFIYLFXYLFETESCSVSQAGVQWPNLG 
RLR 


11970 


25871 


A 


12078 


127 


2 


KKKKTLFFFFFFFFEMESRSVTQAGVQW 
CDLSSLQPPPPIiV 


11971 


25872 


A 


12079 


206 


334 


HNRVTIVNHNLIVHFKITKRCWTQWLTP 
VI PALWE AKAGGS S E 


11972 


25873 


A 


12080 


135 


2 


KMKRKFGEDSQ I P KTLQTFFFLTESRS V 
AQAGMQWCDLGSLHSC 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11973 


25874 


A 


12081 


147 


3 


ATTPGLFFIFYFFETESHS IAQAGVQCM 
I S VHCNLCLPGS SDS P VLV 


11974 


25875 


A 


12082 


106 


2 


YF I P INEGYCF I YLF FETE SRS VVQAGV 
QWCDLV 


11975 


25876 


A 


12083 


164 


346 


CPYKKRKKANQAKWLTPVIMLFGRPRRV 
DRPSSGVQDKPGQHGETPSLPKIQKKSQ 
AWWCA 


11976 


25877 


A 


12084 


186 


1 


KSLFSKGNPLFKGPPPPPFFFFFFFETR 
SRSVTQAGWGAVIMAHCCLNLPGPSDPP 
TPAPRA 


11977 


25878 


A 


12085 


98 


2 


CPFFRVNFFFFEMESRSVAQAGVQWRDL 
GSRA 


11978 


25879 


A 


12086 


154 


2 


SPTPEKGVWELPSPFFSWGPKIWGCIFF 
FLYETESGSVAQAGVQWRDLGS 


11979 


25880 


A 


12087 


326 


3 


KTSVITCGLPMGWSWGPFPKGPWWCKKF 
CVCKFWANQTLKKNLGAGRFQKPFSAAA 
FLEMGGFPFKFFRFFQGSQFFFFFFFET 
ES CS VTQAGGQWCNLGSLRPbPP 


11980 ' 


25881 


A 


12088 


154 


3 


KEFS FFAPGGKQRGE IRSLRGPPPQVKP 
LFFFFFFETESRSVAQAGVQWR 


11981 


25882 


A 


12089 


174 


2 


S WKAI LQYSLE 1 1 LYLPS FQ I FLCFSHT 
HTHTHTHTHTHTLYSQ IMVLLLPSLRKK 
GS 


11982 


25883 


A 


12090 


265 


30 


WFIVYSE I S QLWGLPS FNTLGTWQSLS F 
IFVAEMRSCCVTQAGLELLASSDPPVSA 
SQSARIRAMSPSVAWVIPGRSR 


11983 


25884 


A 


12091 


257 


1 


GSVMRMHTEEQYPSNKVEQSSSGFIRPH 
LVYRVCFYLSVCLSVCLSIYLSIYLSIY 
LS FFLSFFLLRKSLYLS I FLS FFLLS PR 
V 


11984 


25885 


A 


12092 


311 


404 


LYHQNPWLYSVFFFLETESCSVAQAGVQ 
• WCD 


11985 


25886 


A 


12093 


331 


83 


GDDYKGARENSRDDKLFYVLIMMWIQL 
HVFVKIHKTLQLtfGYILLCKLYLINLTK 
NNFQKPKTNSQKTLHIFHPDSTAANI 


11986 


25887 


A 


12094 


82 


2 


LREPNLLNPGGEGCNEARWCHCTPAL 


11987 


25888 


A 


12095 


255 


393 


HGILFSSFVCNLNQFLSQHWLMPVIPAL 
WE AE VGGSLE PRS S TLAW 


11988 


25889 


A 


12096 


80 


3 


PFFFFFFFFETESRSVARLECSGMI 


11989 


25890 


A 


12097 


367 


1 


PRVDKPTKMGKKQGRKTGNSKKQSPSAP 
PKERSSSPATEQSWMENDFAKLREEGFR 
RSNYSEL.QEEIQTKGKEVETFEKNLDEC 
ITRIPNTEKCLKELMELKPKAQELREEC 
RSLRSRCDQL 


11990 


25891 


A 


12098 


94 


236 


MWADYFCMSPS IDEGLKKKKKKKKKKKK 
KKKKKKKKKKKKKKKGGAP 


11991 


25892 


A 


12099 


410 


212 


NSLHPSTSLFRFSSHFHIKLISGQAQWL 
TP I I P VP WE AKAGGLLETRS S RLAWAAQ 
EDPISMPVCK 


1 1992 


25893 


A 


12100 


140 


324 


NLGQAQWLMPVI PARPWEAEVGRSLAVR 
S S SNS PAS AS Q SAG P TGPSHHNWLMFLQ 
LLTRL 


11993 


25894 


A 


12101 


247 


357 


KQIQGRAWWLTPVFS PLWEAKAGES PE V 
RSLRPAWPT 


11994 


25895 


A 


12102 


245 


2 


DVLVGGLCIiPAGGTLHLTS I FLLSGCWC 
GSKDEEAPCKQRISVQRESQSRTLRAGV 
S P KKAHP CEMCGL I LEDVLHFPDLQ 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyialanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11995 


25896 


A 


12103 


230 


379 


KEASFFETESHSVSQAGVQWRSLNPLGS 
GNP P T S TS RVAGT TGM SHHVWL 


11996 


25897 


A 


12104 


86 


2 


DGVS VLS PRLECNGL ILAHCNLRLPGS S 


11997 


25898 


A 


12105 


116 


2 


SQKTPFFFFETVSLLLPKLECNGAISAH 
CNLCLPGSSD 


11998 


25899 


A 


12106 


271 


416 


QFTFSKAFNLVFIKPSFHISFYRTHYSQ 
AWWLTP VI PELWEAE AGGS P 


11999 


25900 


A 


12107 


194 


3 


HVLGTVLVAEDTASNKIRSLQGCLLLPL 
LFSWLEVLARAIRQEKEIKVIQIGREE 
VNRRRGR 


12000 


25901 


A 


12108 


142 


3 


LCLLYWDCKRHAWWRGQVRWLTPVIPAL 
WE AE AGGS PEVRSLRPAW 


12001 


25902 


A 


12109 


233 


346 


HQRSLIGWAQWLMPVIPALWEAEAGVSP 
EVRSLKPAWP 


12002 


25903 


A 


12110 


234 


349 


GFSPFYDFCLFVFETESCCWQAGVRWH 
DLGSLQPPPP 


12003 


25904 


A 


12111 


136 


333 


RLNFF YFYFFETAS CS VAQAGVQWHDLA 
HCI PAWETEQDS I S KNTVQKRKKKVTRA 
GISKTKNEMF 


12004 


25905 


A 


12112 


345 


110 


QDGLCLVMTLQETQPILAYSLWFPIVFP 
TTKACNVQGDSKFLLKKKYLGQVQWLMP 
VIPTLWGADVEGSPELRSLEPA 


12005 


25906 


A 


12113 


108 


285 


YNAMKNRFLKTILNNKNSIGWARWLMAV 
I PALWEAKAGRS PE VKS SRS RPGAVAQ V 
DAA 


12006 


25907 


A 


12114 


1 


364 


RWAAEMGKFMKPGKAALDLAGRYSGRK 
AVIVKNIDDGTSDRPYSHALVAGIDRYP 
RKETAAMGKKKIAKRS KIKS FVKVHNYH 
QLMPTRYS VD I PLDKT WNKDVFRDPAL 
KRKARREAK 


12007 


25908 


A 


12115 


120 


3 


TQ I WGAFL TL F F F F F FE AKS HS VAQ AG V 
QWCSLGS LQA 


12008 


25909 


A 


12116 


120 


3 


TQIWGPFLTFFFFFFFEAKSHSVAQAGV 
QWCSLGSLQA 


12009 


25910 


A 


12117 


231 


1 


FLSPFSGYSAITKLIiSLSYFLLGHIIiLL 
LGS T P E AAAQ WQ W VS FADS D I VP PAS T 
WVFPTLGIMHHNKQATENAKE 


12010 


25911 


A 


12118 


216 


1 


L I FPPLLNFC YGE VLGFQGFLALFFACM 
FFWPRFSPKTFFFFFYEKEFHSAAQAGV 
QWCDLGSLQPLPRGFK 


12011 


25912 


A 


12120 


267 


1 


KKKKKAFP S P PLLGC F I RLQKHL LG I LV 
YRSHLISSLIiCLEGIILSLFIIATLITL 
NTHSLIANIVP IAILVFAACEAAVGLAL 
LVSIS 


12012 


25913 


A 


12121 


167 


372 


ISGQDLLKETKRVKRPFQQDDVPFINIF 
VP T PGAPRS LRQ I LE LRGKTD P S P 1 1 VG 
DFNTLFSALtDRS 


12013 


25914 


A 


12122 


124 


2 


GLLKFNIFAPFFFFFSETESHSVTQAGV 
QWCDLGSLQPPP 


12014 


25915 


A 


12123 


121 


1 


R F TAAS ANS GAVS AAVG YMGD P G S E 1 1 E 
SVPPAGPVSSW 


12015 


25916 


A 


12124 


176 


2 


RDIiRGFSRFSNYKFWNFGRVGGI FKFVF 
GKG VKR FFFFFFETESCS VAQAGGQ WRD 
LG 


12016 


25917 


A 


12125 


137 


2 


KLGFFGGGGGKIFAPQKNFFFFFFFFFE 
ME S CS VAQAGVQWRD I G 


12017 


25918 


A 


12126 


103 


3 


LE ENLGNT I QD IGRGKDFTSKTP KAMAT 
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nucleotide deletion, \— possible 
nucleotide insertion 














KAKID 


12018 


25919 


A 


12127 


212 


351 


LXTFEILNICLGDNILDLGLARWLTPVI 
PALWEAKAGGS PEVRSLR 


12019 


25920 


A 


12129 


187 


3 


RGSTMQQTNTRTPGVRVLYYILMIATMA 
I P Y I PMANLLLL I VP I L I AMAF I MLTER 
KILGY 


12020 


25921 


A 


12130 


13 


329 


ASRVTIRVMASNSTKSFLADAGYGEQEL 
DANSALMELDKGLRSGKLGEQCEAWRF 
PRLFQKYPFP ILINSAFLKLlADVFRVGN 
NFLRLCVLKVTQQSEKHLEKI 


12021 


25922 


A 


12131 


231 


330 


MDTDE I YLGRAPWLTPVI PALWEAKASG 
SPEVR 


12022 


25923 


A 


12132 


95 


2 


KGKCFPPFFFFETESRSVAQAGVRWRDL 
SSL 


12023 


25924 


A 


12133 


129 


1 


DLALLPRLWCSAMIMAHCNSELMSSSDP 
PISASQIPGTIGMCH 


12024 


25925 


A 


12134 


196 


361 


TIMEYYAAMKKKSESPLLHFQGKKLNQD 
GGWAWWPTPVIPALWKAKTGGSLEPRS 


12025 


25926 


A 


12135 


120 


1 


PP P PGLFFFFFFETES YS VAQAGVQWCN 
LGSLHPPPPEFK 


12026 


25927 


A 


12136 


235 


3 


KKG I R I E RD RLKQL S VCQR FQ I±AKF P P F 
PSGVGEKQNFFFKKKKKKKGRAQWLTPV 
I PALWEAEAGGS FEVRSSRPA 


12027 


25928 


A 


12137 


185 


1 


IWCPFFCPGPQICSFRCHFFFPFFFFFF 
LEAETRSFAQAGVQWCDLGSLQSSWVTE 
LAAAS 


12028 


25929 


A 


12138 


158 


1 


LRRGGVFS I FFCGGTMVLS PADKTNVKA 
AWGKVGAHAGEYGAEALERMFLSF 


12029 


25930 


A 


12139 


76 


3 


KVIiARAI RQ E KE I KD I Q I G KE E VK 


12030 


25931 


A 


12140 


157 


2 


FFSSPRIiLKRRPGNFLGARENFFFFFFS 
ETESPSIAQAGEQWRDLGSLQAP 


12031 


25932 


A 


12141 


234 


388 


LSQLMS ANLFFVLLFEAE FHS VNRLE CS 
GMIWAHCNLHLPGSSESPASTSQ 


12032 


25933 


A 


12142 


112 


453 


LGRRQ AASMREG I S I HVGQAGVHIGNAC 
WELYCLEHGIQPDGHMPSDKTIGGGDDS 
FNTFFNETGAGKHVPRAAFVDLEPTAID 
EVCTGTYRQLFHPEQLITGKEDAANNYA 
RG 


12033 


25934 


A 


12143 


44 


443 


AKLGTRKLP L KAKMGKE I TL IN I WI GH 
VDWGKTTTTGHLIYKCGGIDKRTIEKLE 
KEAAEMGKGS FKYAWVLDKLKADRERGI 
TIDISLWKLDTSKNYVTI IDAPGHRDFI 
KNMITGTSHADCAVLIDAAGV 


12034 


25935 


A 


12144 


3 


386 


REAATMREC I C I H VGHAG VQNGNAC WE L 
YCLEHGIQPDGHMPSDKTIGGGDDSFNT 
FFSETGAGKHVPRAVFVDLEPTGIDEVR 
TGT YRQLLHP VQL I TGKEDAANNYARGH 
YTIGKE I IDLVLDRIR 


12035 


25936 


A 


12145 


3 


386 


AGAT Y I DRLRVALFE AT T F TLWVRP E L S 
SGEATTMRECMS I HVGQAGVQNGNAC WE 
LYCLEHGVQPDGHMPSDKTIGGGDDALN 
TFFSETGAGKHVPRTEEVDLEPTEIGEE 
GTGTYRQLFHPEQHMM 


12036 


25937 


A 


12146 


3 


385 


GRATY I DRLRVALLEATTS TLRBRRELG 
S RE AT TMREG I C I HVGQAGVQNGNACWE 
liYCLEHGIQPDGQMPRDKTIGGGDDFFN 
TFFIETGAGKHVPRAVFGDLEPTDIGEI 
RTGTYRQLLHPEQLI 
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nucle- 


E=Glutamic Acid, F=Pheny1alanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \= possible 
nucleotide insertion 


12037 


25938 


A 


12147 


11 


381 


HTPEGRTRVPVLIAGVVYCQEALRDWGR 
VTASSTGAMAFLRSMWGVLTALGRSGAK 
LCTGCGSRLRS P FS F VYLPRWVS S VLAR 
CPKKPVSSYLRFSKEQLPIFKAQNPDAK 
TTELIRRIAQR 


12038 


25939 


A 


12148 


321 


509 


YSMDQYFIPFYCQIMHLCFFLKTRVSLL 
FPWLECSGATSAHCNFCLLSSSNSPASA 
SRVTGIT 


12039 


25940 


A 


12149 


51 


400 


AATMRE C I S I HVGQAGVH I GNAC WE L Y C 
LEHGIQPDGQMPSDKTIGGGDDSFNTFF 
NETGAGKHVPRAVFVDLE PTVI DEVRTG 
T YRQLFHPDQL I TGKEDAANNYARGHYT 
IGKE 


12040 


25941 


A 


12150 


25 


399 


EATTSTLGLRHQLGSRESTAMRECIS IH 
VGQAGVHI GNACWEL YWLEHGLHPDGQM 
PGDQTIGGGDDSFDTFFSETGAGKHVPR 
AVLVDLEPTVIDEVRTGTYRRLFHPEQL 
ITGPEDAANNYAR 


12041 


25942 


A 


12151 


1 


402 


TKEEELCLACRQRRADGGWRSQMNAGSD 
P VG I VSAART 1 1 GS FNGALTAVPVQDLG 
S TVI KKVLKRAT VAPED VS E VI FGHVLA 
AG CGQN P VRQAS GGAG I P YSVPAWSCHM 
ICGSGLKAVCLAVQSIGIGDST 


12042 


25943 


A 


12152 


3 


400 


TDRLRVALKKTTTS S FCFPHHLGSRETA 
TMRECIS IHVGQAGDHIGYACWELYCLE 
HSLQPEGQMPGDKTIGGGDDYFNTFFSE 
TGAVKHVPRAVFVDIiEPTVIDEVSTGTY 
RQLFRPERL I TS KEDAVNI Y 


12043 


25944 


A 


12153 


2 


398 


GRCALDRNTF I FARFGCYL I ASGHPGE K 
LMDMDMS P LRPQN YL FG C KLKAEND YHF 
I VANDENE HQ I S LT TAS LRAGANDDMN I 
VEAEAMNYEGTP I KETLATLKMS VQATD 
SLGGSEITPPSVLRLKCGSR 


12044 


25945 


A 


12154 


203 


405 


FTCPSRIICVISRRISPCCAPDLNPMLS 
ANAMLDFVFTVEDPGGWDSKNLQKKWSH 
YSFLTGIRPKI 


12045 


25946 


A 


12155 


229 


397 


TKI AHHKAGFAL I S KKITKNLKMFLS KF 
L P I HALW VTGS S GMQ P YP L VWGH YDLG K 


12046 


25947 


A 


12156 


2 


399 


GGVPHC VWATAWGMRP GL P G P TGL CAQT 
S SRGQKSVDKQKES CGIWQLYHFLSRKQ 
E PRWE P C VSGS S SGDGAVADLADELRG Y 
PALCCTLPVHSYRSWAGIRPQIMNGPLH 
PRPLVALLDGRDCTVEMP I L 


12047 


25948 


A 


12157 


85 


414 


ALLPQSEALQGAVTMPHSYPALSAEQKK 
EVSD I ALRIEAPGKS I LAADE SVGSMAK 
RLSQIGVENTEENRLLYRQVLFSADDRE 
KKC I GGV I FFHETL YQKDDHGVP FVR 


12048 


25949 


A 


12158 


85 


407 


GLLPHSEPLQRAVTMPHSDPALYAEHKK 
DVSDIALRIGSPGKGIIAAYESMGSMAN 
P LNQMG VENT KQNRRL YRQDL FRAEDRE 
KKSLAGVI FFHDTLYHKDDNGVP 


12049 


25950 


A 


12159 


85 


406 


VLLPHSEALEGAVTMPHSYPALSGEHNK 
EliSDIALRI VAPGKGI IAADES VGSMAK 
RLSQIGVENTEENRPLYRQDLFSADDRG 
KKSIGGDLFFHDTLYQKDDNGGP 


12050 


25951 


A 


12160 


275 


146 


EKTFI ISFFFFFETESHSVAQTGVQWHD 
LGSLQAPPPWFRRIA 


12051 


25952 


A 


12161 


212 


2 


KRSRPFFFFFKEMGSCYIAQVGMQWLFV 



1273 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 
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peptide 
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Predict- 
ed end 
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peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K==Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GT I TAG YDPELLGS KDP PTS AS QVTGNT 
GTHTHTQLIFVFLV 


12052 


25953 


A 


12162 


253 


2 


GRVD SEC CLANGNAE KS I F C C LLVYL FY 
LSSRQSHSVAQAGVQWHDLGSLQPPPPG 
FSLLSRVAGTTGARHHDWL I FVFLGET 


12053 


25954 


A 


12163 


228 


1 


liLGLWFEQLGRFTKIAKTKGERSQTQSG 
HSMDMRVPAQLLGLiLLLWFPGSRCDIQM 
TQSPPAVSASVGDRVIITCR 


12054 


25955 


A 


12164 


122 


1 


RLLFFETESRSVARLECSGAISAHCSLR 
LPGSGSSPASAS 


12055 


25956 


A 


12165 


239 


350 


GQARWLT P V I PALWEAKAS KSLEVRS LR 
LAWAGHGGS 


12056 


25957 


A 


12166 


235 


359 


TDKAISKRDLSVLREIIFFEMESRSVAR 
MECSGVISARCNL 


12057 


25958 


A 


12167 


209 


396 


QFSGIPFDQYKCGFKNYIYIYIYIYIYI 
YIYIYRKYTRAYTKRYMCVFSRATHNIV 
YLCAYI 


12058 


25959 


A 


12168 


327 


130 


GGVGFLGGPGEKPPRPSPFFFFFFEKES 
RSVPQARVQWPDFGSLPAPPFGFTPFLR 
ESFLFSSLMI 


12059 


25960 


A 


12169 


317 


422 


RGEKPLFFFLQKKKKKKKKKKKKNIKKG 
GRRYKPS 


12060 


25961 


A 


12170 


260 


375 


EKVTCSWTQWLTPVIPALWEAKAGRSLE 
FRSLRSAWAI 


12061 


25962 


A 


12171 


304 


3 


KFFFFFKGFFFLGGVGPIFPPPKKRFFS 
KIPPGVFFFPPLKKKIFFFFPPVILGPP 
RVFFKGAPLFFFFFFFFFFFSETEFRSC 
CPGRLECSVAVSAHCK 


12062 


25963 


A 


12172 


105 


3 


L I FLRQS S ALLPRLE GNGAI SAHCWLCL 
TGSSD 


12063 


25964 


A 


12173 


146 


17 


RIFFFFFLGQSLAVAQAGVQQRDLGSLK 
APPPGVHAILLPQEY 


12064 


25965 


A 


12174 


234 


2 


LFPRKPKPSVSLSLSLSLSLFRLRKLSS 
FMRHNN I EVRP INNPAMTS KCSSERKS L 
VSFTLNLKPGMIRLSEEGMLK 


12065 


25966 


A 


12175 


292 


380 


VRGLGRVAHTCNPSTLGGQSGW I TGGQE 
F 


12066 


25967 


A 


12176 


109 


1 


GRIKKVT KGQAQWLMPVIPALWEAKVGG 
SPKVRSSR 


12067 


25968 


A 


12177 


170 


291 


LINFYFYFLRQSHSVTQTGVQWCNLGSL 
QPPPAARRRRRG 


12068 


25969 


A 


12178 


260 


380 


LTYIALFIFIiRRSLTLLPRLECSGMISP 
HCNLCLLGSSDS 


12069 


25970 


A 


12179 


54 


166 


P KRGFHRVT QEGLNLLTLGS PRLGLPKG 
WDHRQDPLP 


12070 


25971 


A 


12180 


273 


359 


APAGHGGS CLQSQHFGRLRQADHLGSGV 
R 


12071 


25972 


A 


12181 


27 


225 


IGQAQWLTPVILAFWEAKAGGSLEVRSS 
RPAWPIYFLVLLASYLRRFCLIQGHSDL 
LYTSSTRGS 


12072 


25973 


A 


12182 


1 


155 


VHKIFIAYKYSGFQTVFRGRAWWLTPVI 
PALWEAKANRS PE VRS SRSGRRG 


12073 


25974 


A 


12183 


91 


2 


FNLFFFFFFETGSRSVARLECNGAISAC 
C 


12074 


25975 


A 


12184 


122 


2 


GQAQWLTPVI PPPWEAKVGKS PE VRS SR 
RTRGRTRGRTRG 


12075 


25976 


A 


12185 


389 


20 


TDRGRRARRACFCGKVFDGELSFALKLA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L— Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














REMGRPDWRAMLAGMSSTEYADWHRFYS 
THYFHDVLLDMHFSGLTYTVLSLFFSDP 
DMHPLDFSLLNRREADEEPEDDVLMQKA 
AGLAGGVLFL I 


12076 


25977 


A 


12186 


180 


393 


LNPGSGGCGEPRSRHCTLAWVTERESVS 
KNKI KI KINKQ I EEGFL FQP FG QGF I WC 
YFPAKE F 1 1 ALS KT 


12077 


25978 


A 


12187 


37 


443 


PDFP I PFPPRKVQHLARPS PKTARPAWT 
I RR I L P P P P KKRHP PGLARGKFGPRE LA 
KAGLAKE CPAGG P T PAVR FAT P TRGGGG 
RVAGFQTPMGI PMGKARLGLLTFLGFAS 
GWIGAYRPRETLCGGELVDTLQF 


12078 


25979 


A 


12188 


3 


411 


AFPENAATGSTFQDP VPAS KGRAEVGNM 
RLSVAAPISHGRVFRRMGLGPESRIHLL 
RNLLTGLVRHER I E APWARVDEMRGYAE 
KL I Y YGKLGDTNQRAMRMAD F WLTEKDL 
IPKLVQVLAPRYKDQTGGFTRMLQ 


12079 


25980 


A 


12189 


2 


414 


QE FGTR KRAT F I S LLF F FS S VY S RGVFR 
RDAHKSEVAHRFKDLAEETLRALTLIAF 
AH YLHQ C P FADHVKLGNE ATE FAKT C VA 
DESAENCDKSLHTLFGDKLCTDAPLRET 
YGEMADCCAKQEPERNECLLQHKDD 


12080 


25981 


A 


12191 


3 


514 


PRLLMEAGPHPRPGHCCKPGGRLDMNHG 
FVHH I RRNQ I ARDD YDKKVKQAAKE KVR 
RRHTPAPTRPRKPDLQVYLPRHRDVSAH 
PRNPDYEESGESSSSGGSELEPSGHQLF 
CLEYEADSGEVTSVIVYQGDDPGKVSEK 
VSAHTPLDPPMREALKLRIQEE I AKRQS 
QH 


12081 


25982 


A 


12192 


182 


3 


RGLFFIPPPPIKTKIWGPFWKWFMGFG 
PPLFFFFEMESRSVAQAGVQWHDLGSLQ 
ALP 


12082 


25983 


A 


12193 


53 


400 


PETPSWLGPVRRFYFIIIKSVWKWKKKK 
KKKKKKKKKKKKGGGPFKKTLGGDKFYW 
GVKKKIFFFLGGSKKHPWGFFEKKLFFG 
GGKCGAP P PKD I S CLWGKKNFLGAIGE K 
TCCC 


12083 


25984 


A 


12194 


182 


3 


KGIFIQIAPPKKKKNWGPLWKWFKGFG 
HPIFFFFEMESRSVAQAGVQWHDLGSLQ 
ALP 


12084 


25985 


A 


12195 


200 


378 


KSSKSQWLGYLFIGLLIYLFIKFFLETR 
SCFVTQAGILAHCNLKLLGSSDPPTSAS 
QAA 


12085 


25986 


A 


12196 


341 


54 


SLSFHGGLSVLCIFSTYKKLSQEQWLTP 
VISVLWEVKVSRSPQVRSLRPGDPPASA 
SQSAGITGVSHHARPHLSLNPRLEMIKL 
SEEGMLKRMTG 


12086 


25987 


A 


12197 


1 


462 


GGPPRPFRMKAAVLTLAVLFLTGSQARH 
FWHQDEPPQSPWDRVMDLATAYVDGLKD 
SGRDYASQFEGYALGKQLNLMLLDNWDS 
VTYTFS KLREQLGP VTQEF WDNLDKETE 
GLTHEMSKDLEDVNAKVQTYLYDFQKTW 
QKELTFTAESEPVP 


12087 


25988 


A 


12198 


80 


1 


SNKVFFLETESCSVAQAGVQWCDLGS 


12088 


25989 


A 


12199 


221 


3 


TMGIMLDKKE IQAI FS FKFKMGHAVAET 
TCNINNTSGPGTANKVTVQWWFKKFCKG 
DESLEDEEHHGRRLEV 


12089 


25990 


A 


12200 


239 


336 


FGALPITTCKFGKKKKKKKKKKKKKKKK 
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Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G^GIycine, H=Histidine, I=Iso leu cine, 
K==Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKKKK 


12090 


25991 


A 


12201 


279 


366 


KKKKKKKKKKKKKKKNKKKKKKKKKGFK 


12091 


25992 


A 


12202 


153 


40 


GFLFLIFFFFETKSRSIIQAGVQWCHLG 
CTLAWATETD 


12092 


25993 


A 


12203 


317 


406 


LGKLKCNGAVSGHCNLRLPGSSDFSGSA 
S 


12093 


25994 


A 


12204 


2 


925 


RSGPDHELTLDSGKGCTRSMVPASAASE 
DRRKLPIIVEDEGGPTRSRACSSPARGS 
RPPPSAIGCSPVAQASDSAAGPARRTAL 
QSLSSWLGYQIDRHSVPVYVFKSPLFSV 
I MAP KH KS S DAGNLDR P KR S RKVLPL S E 
KVKVLDL IRKDKKS YAE VAK I YGKNE S S 
IRE I VKKEKE I RAS FAVS P PTAKVTATV 
RDKCLVKMEQALHLWEEMNRKRVP IDS 
NMLRQKALSLYQDFSKGCSETDTKPFTA 
SKGWLHRFRHRFSHHYKKKKKGIMAQVA 
VSTLPVEEESSSETRMWTFLVSALESM 


12094 


25995 


A 


12205 


272 


1 


KMARAGLLVI EGKVWRTVYYRFATRE ER 
EGKMSTNLMNKLDTIGFDNKKDLLISVG 
DLVDRGAENVE CLE LIT F P W FRAVRGNH 
ERQPGQ 


12095 


25996 


A 


12206 


166 


478 


ISASGLLPTSPLTGTSKLQDPNEHLNLL 
MLNRVS L LL P RL E CNDT I LAH YNLRL P G 
SSNS PDS AFQVAGI TESFTCE INALKDS 
S QVALWS FRCEMDHS S I CNR 


12096 


25997 


A 


12207 


102 


497 


PLLALLFGSQKTFLCLQLSLKPTRSSLL 
S PGSAGNPENEAP CPQLNPEATS LKKKK 
KKKKKKKKKKKKKKKKKKKKKARG 


12097 


25998 


A 


12208 


131 


2 


FGFLGGDKAKGKGPGPPFFFFFETESPS 
VAKAG VQWCDLGS LH 


12098 


25999 


A 


12209 


221 


499 


Q TRHKLTNS R S GRTGLQ I L I KE QHS S GR 
WCGTGDILWQSLEDTICYSVPKSLLDFL 
LFIYFETESHFVTRLECSGAISAHCNLR 
LLGSSDSPA 


12099 


26000 


A 


12210 


297 


398 


HNFFFFLKTESCSISQAGGHWYDLSSPQ 
PPPPG 


12100 


26001 


A 


12211 


341 


452 


NWSPGLKQSSRLSLPKCWDYRREPPCLA 
GLLAYLFRR 


12101 


26002 


A 


12213 


214 


468 


FFFFFFWGKGVFFFFPPLEGRGGNSVYW 
T P P PRG KKNF P P LGLGKMGNNP P P P P PR 
F FF F F F KKKGVF FLGP GG VKT P S LRE T P 
P 


12102 


26003 


A 


12214 


223 


477 


CPTETQLQLIFAREKHKPSPVLLDLDLY 
GFFFEMESHSVS PLECSGP I SAQSNLYL 
LGSHNSPA 


12103 


26004 


A 


12215 


263 


55 


TPKRYGKMQTATYLPKSKIYIIISDLRK 
QD Y I VYKQ S KKP EGWAQWLT P VI PAL WE 
AKAGRSPKVRSSR 


12104 


26005 


A 


12216 


328 


471 


GVIHAQVLHAHTHTHTHTCTHTQVHAHS 


12105 


26006 


A 


12217 


434 


1 


WDHSAS PPTSVGTPLRADGPHPRLTAPG 
HWGSSAMLSAPAAGRRGLSLGRWCDRD 
KLSQRGPQPKWILWWGNLPGGGFLHRHI 
HKPLPT I P I PST VRGEEVLQQLLHS AS F 
ILFIFI FLDRVS LCHPGWS AVARS RFTT 
TSTS 


12106 


26007 


A 


12218 


304 


43 


EKPIGVAPSRVKKRAGGGEKTECCLLKG 
PLQG FQG VL KG P LKLNGPL F F F FFLRRS 
LALFRLECNGT I LAYCNLCLLGS S DSPA 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \— possible 
nucleotide insertion 














SEI 


12107 


26008 


A 


12219 


171 


365 


ALVQPRAGREAVTVPACCS CSDVFLYET 
NKVARIQSINYGTIKFFHVIVFSYVSKE 
HLKRKLEK 


12108 


26009 


A 


12220 


171 


452 


ARYKFLHP KFAFGLYP SSSQI QGARHQL 
WILIKCSFYLLFLRQCLTLLPQLECSNM 
ITAPLQPQLPGLKQSSHLSLPSSWDYRH 
EFP 


12109 


26010 


A 


12221 


323 


458 


FRVQRSIiFLFIYFFETESHPWQAGVQW 
HDFGSLHRISP 


12110 


26011 


A 


12222 


189 


56 


DRVSLVQPTLECSGTISVYCSL.NLLGSS 
DPPTSASPVASTTGGI 


12111 


26012 


A 


12223 


246 


494 


FLCSSYAYDDHFLRPFFPIHIilLDITAF 
SQDTI ISNLLFQDACSVPKKAANLGQAQ 
WIA.PVI PVLWEAEVGRLLEVRS SRPAW 


12112 


26013 


A 


12224 


55 


353 


RIQGCSCWVKLCPKVEKWVQRTDAEQES 
QTKAE I QDMKQELSAVNMMDKFARSARL 
ERKINKMMNKLKTHVKVQTAQSGMLKWV 
ISVAFYKLPGTVIRL 


12113 


26014 


A 


12225 


199 


23 


LKLSS IYYVPGTILKYFTGWARWLLPVI 
PALWEAEVGGS PEVRSSRPANS TSW I ER 
HQ 


12114 


26015 


A 


12226 


369 


14 


DCVRVGLSYPQTVCPSCESVRKAPFSSR 
PHRGVLESGQPRCKSRRVDIFFSMERER 
DRERERERERERERGQKRTRRVWGGERE 
KLGYSHAWGWRGGRGLGWGGMCGIPPPC 
TSDVHQ 


12115 


26016 


A 


12227 


206 


2 


QNGHMNGGP CRVS L S LFLETRS FLF FLA 
SLSMLECSGAILiAHCNLRLPGSSNSPAS 
AS Q VAG I TGACH 


12116 


26017 


A 


12228 


231 


501 


RSWEGQAIjHGSDPLASCTRIQSNYMALQ 
RINQELEDKLYRMGQHYEEEKRALSHEI 
VALNSHLLEAKVT I DKLS EDNVS AAHTW 
PMEDCR 


12117 


26018 


A 


12229 


86 


455 


DLPTFPVPPHPGFSGPLLGQSELPSNCQ 
TGGRSHLSVSVLCPTLATTPLGliDFPTC 
GTSQLPIiGPLQLAHRSELAGLSIjPGLST 
NCTSQRLPDRPHRHSDAPHTHTHTQTHT 
HTHTHT PATVT 


12118 


26019 


A 


12231 


211 


96 


KSHFKAGGGGPPRYS PPFGGGGGGVFRV 
GGFPPPGLTQ 


12119 


26020 


A 


12232 


228 


3 


KRWFIKGFLSKGLGPPPPKLKFLGFFFF 
FFFEQS FVLVAQAGVQLE CSGT I LVHFN 
LYLPGSSNYPASASHPRV 


12120 


26021 


A 


12234 


137 


3 


VPLPGTLKNHSMVPGFLFVCLFVCYETE 
SHSVAQAECSGAISAH 


12121 


26022 


A 


12235 


139 


1 


FKLCFGQAWWFKPI IPALWETDASRSLE 
VRS SRPADAWADAWADAW 


12122 


26023 


A 


12236 


80 


1 


VLLIFIFLEEMGFHHVGQDGLDLLTL 


12123 


26024 


A 


12237 


187 


494 


TERNS FN INKKD I HTKTPF VGYQHQRP K 
VDKTTKMGRNQSRKAENSKNQSTSSPPK 
DRSSLPATEQSWTENDFDELTEVGFRRS 
AITNFSELKEHVLTRRTE 


12124 


26025 


A 


12238 


252 


503 


CQGVGADDLQSQAE IQCELCAES IDRFD 
ILFFFFLGLGPPHKNPPPPTPQKWKPGK 
APPIiALWD I PRGAKGPGRQ PGRPAI HTC 


12125 


26026 


A 


12239 


2 


471 


RIALCPAVRIRHEERERERERERERETP 
GHTQLSPGARRTPPLERERAFNDCFSLN 
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ing to first 

amino acid 
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correspon 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leuctne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














YPGNL C PGDL I E VFRS VYQHCAL YLGDG 
YFINITPVYGIPAFFTSAKSVFFSRALE 
KTHLLKDWGSDTYRINYICYETLPSLS 
VEEI ITRSEFVIGQEA 


12126 


26027 


A 


12240 


14 


149 


FMTACR I RHEMGRGLLRGL W P LH I VLWT 
RIASTIPPHVQKSALPH 


12127 


26028 


A 


12241 


588 


671 


RSRPSFQEQAFESSQKYKEGKYI IELNH 


12128 


26029 


A 


12242 


384 


464 


RGGQVWWIjKP WPALWEAKAGRS LEAG 


12129 


26030 


A 


12243 


325 


459 


TYLFIYLFIYFETESCSVAQAGVQWRDL 
VSLQPLPNSPALPTN 


12130 


26031 


A 


12244 


323 


457 


S LQRE CGPADT F FEMD S CS VAQAG VRW C 
DLGSLQNSP 


12131 


26032 


A 


12245 


241 


466 


GALFFFFFFFFFFFFFFFFFFLSNKAVY 
F 


12132 


26033 


A 


12246 


385 


37 


Q ARRRCLQTSRME ALGRWS YKVT IEFPL 
QILFFPPILKYKVIFGIKITPNFHFLEE 
YEEGQRLGKSTVSWVFNEDTRL INI WNR 
HITVNPRTNSQNRMLKMAHESKYSQVPF 
PLQA 


12133 


26034 


A 


12247 


318 


456 


L ILKYFMYVCMYVHMYVCMYI KS CSVAQ 
DGVQW YDLGSLQ P P P PGL 


12134 


26035 


A 


12248 


362 


473 


S PCQNNMNSAKTEARTNIKFMAKLMWKN 
GAIIDALQK 


12135 


26036 


A 


12249 


310 


463 


FS I KTRKNALGKLQS LiLRCHQ F YL YVC V 
CVCVCVCVCVCTRTYAILTCIAV 


12136 


26037 


A 


12250 


169 


54 


EAKSGQARWLTLVIPALWKAKAGGCPEV 
RSSRLAWPIW 


12137 


26038 


A 


12251 


49 


449 


GIPATSTSCVQVILLPQPPLLLGLQDSF 
LPQEIIIKVEGEDTGSLTIPSQEGVNFK 
I VT VD F TRE EQGT CNP AQRTLDRD V I L E 
NHRDLVS WDLATAVGKKDSTS KQRI FDE 
EPANGVKIERFTRDDPWLSSC 


12138 


26039 


A 


12252 


130 


428 


RWGLAML PRLVLNS W S Q VI L P P Q P QVGA 
T FGG F PNP P GHWKLLGG I S SGE E AG VEE 
AEEGQALGFLGQFPPSSPQLLLQTSWCL 
RTTRTCRWSLL.WKAH 


12139 


26040 


A 


12253 


204 


54 


PGMHKTQKSDLGQVQWLTPVILGLWETE 
AGDSPESRSSRPAWAKHKIVR 


12140 


26041 


A 


12254 


375 


470 


S L CL F F S F F FQT E PRC VAQAG VQWRHLS 
SLQP 


12141 


26042 


A 


12255 


432 


512 


GQVRWLT P VI PALWEAEGGRS PEARS S 


12142 


26043 


A 


12256 


232 


1 


KGALSKKTQSFWGGQKFFPRPGVFKGVF 
FWGGGFCFFVFGGFFFFFFFFETESCSV 
AQAGVQWCSLGSLQPLPPRFK . 


12143 


26044 


A 


12257 


22 


123 


RFVCST I KVLRDLSSDRSNPGRFLSTSN 
SSSTRP 


12144 


26045 


A 


12258 


71 


362 


QTERNS I S INKKDVHTETPS KGHQHQRP 
KVDKSTKMRKKQRKKAENSKNQNASSPP 
KDHNSLPAREQNWTENEFDELTEIDFRR 
WVITNSSELKEHV 


12145 


26046 


A 


12259 


375 


3 


SPNIEAPPKAFFFFNQRHFVFCFFSCRA 
ENTHE I VWVKVY P FSNQS KAKL F FS F F V 
QRDRSSPDSYLMPLQLQMWDTAGQERFR 
S I AQS YYRSANAL ILAYG I TCEE S FRCL 
PEWLRQIEQYA 


12146 


26047 


A 


12260 


98 


1 


KYQ IDLGGRGS SNS PAS ASRVAG I TGTH 
HHTQ 
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peptide 
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Predict- 
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nucle- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine 7 M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


12147 


26048 


A 


12261 


125 


3 


LFLIQKYITGQERWLTPVI PAPWEAEAG 
RSPKVRSSSPAW 


12148 


26049 


A 


12262 


228 


385 


S CLYLQMI PSS I EKSQGMYKKATRGRAR 
WLTP VI PALWEAGAGRSPEVRS SR 


12149 


26050 


A 


12263 


227 


369 


MLKHKSNKNLQDLGQMQGLTPWPVLWE 
AKVGGS PE VKNSRPAWPTW 


12150 


26051 


A 


12264 


102 


487 


QQNRFFGTERGRGTGRGVSSFPQLSPGS 
LELASSPSPSVGRLGSAASQRSRGGQER 
PKAQLRRPPRPSSSPRLRGRENCAHHAR 
TPGG I LDRP PP TLGTNLLKGGRP AWMAG 
NPPEAHKTANGPGEQA 


12151 


26052 


A 


12265 


391 


476 


PRSYRVECSGSISAHCNLGLLGSSNPSS 


12152 


26053 


A 


12266 


378 


476 


MSSRLGSLEVGILGRAWWLTPVI PALWE 
AKEGG 


12153 


26054 


A 


12267 


166 


58 


NFFFFFFETDSCSVAQAGVQWHDLSHCN 
LCLPASSW 


12154 


26055 


A 


12268 


209 


49 


TPPNFYI KI SRSGQAQWLTPVI PALWEA 
EAGRSPEVRSSRPASPTRRNS ISNS 


12155 


26056 


A 


12269 


127 


1 


SSSFFPliKTFGFLFFFLCFFESESRSVA 
QAGVQ W CHLGS L QA 


12156 


26057 


A 


12270 


154 


27 


I Q T PGKKE FFFFFFET KS CL VAQAGARL 
CPKKKEKENSNFVL 


12157 


26058 


A 


12271 


180 


22 


VLKIHNIGRAQWVTS VI PELWE VKAGGS 
QALRS SRAARTRWDACNFVEKTQVY 


12158 


26059 


A 


12272 


207 


422 


GDSSTRGADEKPKEGVKTDNN1MLINLKV 
MGQDCSMEQFKIKSHIPAGYGGSHRESQ 
HF VMPRWADHLRS GV 


12159 


26060 


A 


12273 


59 


479 


NSLGGGGVYGSRFRFTFPGCRALSPWRV 
RVQRRRCEMSTMFADTIiLI VF I S VCTAL 
LAEGITWVLVYRTDKYKRLKAEVEKQSK 
KLE KKKE T I TE S AGRQQ KKK I E RQ E E KL 
KNNNRD L S MVRM KS M FA I GF C FTALMGM 


12160 


26061 


A 


12274 


354 


439 


VKTGEGS IAKTWNQPKCPTTIDWIKKMW 


12161 


26062 


A 


12275 


366 


499 


TRAQYFWWGIGSQCVTLECSGMIMAH 
CSLDLLGSSDPPTSAS 


12162 


26063 


A 


12276 


136 


37 


KF PGQVHWLKP VI PALWEVKAGGS PE VR 
SSRPA 


12163 


26064 


A 


12277 


419 


48 


DGKLNVSFLTVSSSGGFLIFLLIVRPPY 
FLRYNNS EIRP INNP TMTCKCSS KRKSR 
I CLTFNQKLE 1 1 KLS EEGMSKI KTGQNL 
G I LCHVS Q WDEQHKFVKATKSATAKIN 
LSISKYIMSGA 


12164 


26065 


A 


12278 


190 


1 


RKGVFSPQMGPGLGKGKSPFLAKLNFPF 
KKTFFFFFETGSCFVAQAGMQWLDPSSP 
DAWADAW 


12165 


26066 


A 


12279 


276 


389 


WFNLSFFCFCFLFFETESCSVARLECSG 
AISAHCNLT 


12166 


26067 


A 


12280 


425 


1 


PPHPPPGKPPPPPRPELFFPKTKNKKER 
VLFFSPPPKiaSTFFSLTTPRFFFSPPQKK 
KKKLNPPPQKGPPPPVPIKPPPPLFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFPFFFFSFGLTVRVIMTCWLIVDIGLL 

I ! 


12167 


26068 


A 


12281 


253 


90 


IWPPQGSFKTAAPFFFFFVFLiVEMGFCH 
WQAGVKLLTS S SP PAS AS QTAE KLF 


12168 


26069 


A 


12282 


61 


387 


PKSSSSAPFQISSWFSLPASTSASTMSI 
RVTQQPYKVFNSGPWAFSSSSYMSRPSA 
HI SSLIVSRVGS SSTS FQGGLGAGMGLP 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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K-Lysine, L=Leucine, M=Methionine, 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IjHSSLANRARLSQKKKKRGGRSKEP 


12169 


26070 


A 


12283 


186 


305 


LHHSTVQVREEEHKSLHTQVHTHTHTHT 
HPHLVIMSSHS 


12170 


26071 


A 


12284 


2 


190 


PRVRFDRVNNRSLNPL I SRTMRIEP I PE 
NPKFSVKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKGGGA 


12171 


26072 


A 


12285 


17 


197 


SLGVGGGQGGGGVHFHP CKP F I VLS PVS 
LSKHCSYLPKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKK 


12172 


26073 


A 


12286 


158 


405 


GVASYAEQRKYFLEMEYFPGGEAISIVE 
MTTEDLtHYS INLDDKAPTGFERVYSHFE 
RTCTDDKMLS YC I TCYRE I FCERNKL 


12173 


26074 


A 


12287 


364 


142 


GCAFFLGGPLKERRKRRAVP PL.FFFNFF 
FETE S CS VTQ AG VQWCNLGS LQ PLL I DW 
WGQPGTVAHACHPGTLGG 


12174 


26075 


A 


12288 


171 


380 


VFEAGHAWLQENLTSPDFWGFSFFPTDT 
GMLEKKKKKKKKKKKRKKKKKKDR 


12175 


26076 


A 


12289 


194 


2 


GPRGFWQFWVTKGFFPKKGFVKFYPPAG 
GWFFFFPKRGFPFPSFLFFFDRVWLCHP 
GWISVARS 


12176 


26077 ■ 


A 


12290 


179 


416 


LIFLFTFACAVCFQLARFHFKIHIETAL 
RCLI ILSCPFTHPLYKMYREGQARWLTP 
VIPALWEAEVGGS PEVRSLRPAC 


12177 


26078 


A 


12291 


3 


408 


LAYGFHDRTTYDTPSALMMVLLVDEL IT 
GNKNGSGEAPEYLPEDFRDGEDDAAVTL 
EKQEDLTTLVTLPVTLGEHQRQRETQLE 
AKLLKKRLELGSLLDLVEDLELI IQLKK 
KKKKKKSPLWGAPFKGPLGAPI 


12178 


26079 


A 


12292 


119 


278 


PKSIEAGVKSVLDHPLPNKAKTHLSCKK 
LKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKK 


12179 


26080 


A 


12293 


119 


3 


MLFTGFHTKVMSLNS ITGWAWWFTPVI P 
ALWEAEVGKS 


12180 


26081 


A 


12294 


205 


1 


ILHFYSTBTKQPKGLFLALFYSDCEQEK 
RKGGKLKKIFKKAQGQARWLTPVIPAFW 
EAKVGRS PEVGS 


12181 


26082 


A 


12295 


215 


1 


AFKIKLTMHIFSSPIiCLPQDEFHPFIEA 
LLPHVRAIAYTWFNLQARKRKYFKKHEK 
RMSKE FHGARLEGNQ 


12182 


26083 


A 


12296 


185 


411 


DVPLMFHWLHLHSDFRIISTEIGVETVG 
KRVLHIRILLLLLLLFFEMESCSVTHAG 
VQWCDLGS LQAP P PGFTP F 


12183 


26084 


A 


12297 


281 


388 


RILFFFFFETKFCSVPQAGVQGRDLNSL 
QPPPPGFM 


12184 


26085 


A 


12298 


178 


3 


TLLSVTQVPILPGSASSPLLFSRKDIKI 
PTSVFFSFQTEPRSVTQAGVSWCDLGSL 
QP 


12185 


26086 


A 


12299 


199 


450 


TLIRHMICMYFLLSHRSPFSSVDYFLCY 
REI YSL I KTLNKLGIGGFFLNI I KVMYD 
KPAASIIRSSEKQKAFPLRSGTRIRCPL 


12186 


26087 


A 


12300 


3 


407 


EFKDHSTAMDTEPNPGTSSVSTTTSSTT 
TTTITTSSSRMQQPQISVYSGSDRHAVQ 
VIQQALHRPPSSAAQYLQQMYAAQQQHL 
MLHTAALQQQHLS S S QLQSLAAVQASLS 
S GRPS TS PTGSVTQQS SMVQTS I 


12187 


26088 


A 


12301 


227 


3 


KRGTEGQTLFFPAPRARKSFPPPKMPKK 
VEKEFPKGFFFFFFFETESAPSPRLECS 
GAISAHCSLNPHASAHAS 
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C=Cysteine, D=Aspartic Acid, 
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G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, MNMethionine, 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12188 


26089 


A 


12302 


292 


380 


KCYRWARWLMPVI PALWEAKAGGSLEPR 
N 


12189 


26090 


A 


12303 


13 


186 


VDERHIRLSEMAWVALYLLSLLWATAGT 
STQTQSSCCEYDVFPRRSGKQLPWGRGC 
DL 


12190 


26091 


A 


12304 


253 


409 


ITAVIAAAAAWGVTIIMTNFRSLPRLE 
CNGAI S AHRNLLLLGSSDS PASAS 


12191 


26092 


A 


12305 


153 


64 


GRAQWLMPIIPALWEAKAGGPLDVRSSR 
PA 


12192 


26093 


A 


12306 


265 


420 


GTLMSRNTKLQLCRMNKSRTLGQPWWLT 
P V I P T LWE AKAGG S LEAR C S R PAW 


12193 


26094 


A 


12307 


295 


1 


RPCLKNKKTSPHWKKKFANPLFGKAPVF 
PLDKKPFKPHYRESPRPLKKKNGPKGFF 
FFFFETESCSVTQAEVQWRDLGSLQAPP 
PGSYHSPSKLIIKN 


12194 1 


26095 


A 


12308 


247 


421 


SWCCLKKKKKKKKKKKKKKKKKKKKKKK 
PGGGALKKKKIFS 


12195 


26096 


A 


12309 


168 


3 


GQALLLLCLTVAFSKTTWCHAPILGWA 
WWLTPVI PAL WE AE VGRS PE VRRSRP 


12196 


26097 


A 


12310 


117 


3 


VAYN I AVFRGGVS LL LP RLE C S GV I S AH 
CNLRLPCSS 


12197 


26098 


A 


12311 


256 


101 


EPPTPFFFFETGS GS I TQ AGAQ RRDHS P 
PQSPTPGLSNTPAPASRACESGR 


12198 


26099 


A 


12312 


146 


3 


FVALCIQLFPLWVICFHKKHKTGRARW 
LMPVI PALWE AKAGGS PE V 


12199 


26100 


A 


12313 


207 


3 


VRVRTTGVIMALRGLCFSVEESMTKDEL 
I ARLR S LGEQ LNRD VS L TGT KE E LALRV 
AELKQELDDSR 


12200 


26101 


A 


12314 


282 


2 


NCLAEKVKFMVSAI S LQ PQG I S I LFLS F 
FFEKESCSVPQARVQWPNFSSLQSPPPG 
FKLFFCLS PGGGGCS EPKS CRC I PGWQQ 
NETPSQKKK 


12201 


26102 


A 


12315 


298 


84 


KTPQPPVFALSKNFGPPKTAPIFFFFFF 
FETKS YSVAQAGVQWRDLGSLHS SLGHR 
ARLHLKKRKRKS INL 


12202 


26103 


A 


12316 


124 


1 


SKGHFFFFFFSETESCSVAQAGVQWHNL 
GSLQVPPPRFTPF 


12203 


26104 


A 


12317 


132 


398 


WTNAVFS I VT F S P S VCH S E VALAAYKW 
L VC YLLRE T YQ KLNQ EIKPPTLS VP KKK 
KKKKKKKKKKKKKKKDKKK 


12204 


26105 


A 


12318 


262 


392 


PHYCQNQPQARRSGSHQHFGKQRREDHL 
SLGVRDQPGQYSETP 


12205 


26106 


A 


12319 


168 


3 


EAPKAISNCTTALQPRLQGKNLSQNKEG 
QVQCLTPVIPALWEAKVGGSPEVRSS 


12206 


26107 


A 


12320 


48 


385 


DLGLSGHS TLAS S F I SLLLLS YKKCLS P 
S TM I MRL P LP CGS P I KP P FL P KKKKKKK 
KKKKKKKKKKKKKKTGG 


12207 


26108 


A 


12321 


218 


83 


MLSP I FPEGNFFFFFFEMESRS VAQATV 
QWRDLGSLQAPPPRRKF 


12208 


26109 


A 


12322 


197 


1 


KETWPFPPKKEQGLFSFFKKRIFFFFSK 
KIPSPPFFFFFFFETESCSVAQAGVQWH 
DLGSLQAPP 


12209 


26110 


A 


12323 


128 


36 


KGLFFFFEMGVSLLPRLECSGTTSAHCN 
LP 


12210 


26111 


A 


12324 


140 


3 


I LNSNLKP IKKNTFFFETESRS IAQAGV 
QWHDPSSKQPPPLGFKR 


12211 


26112 


A 


12325 


273 


369 


RKEWFLKKKKKKKKKKKKKKKKKKTKKK 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKK 


12212 


26113 


A 


12326 


140 


415 


RTGLLGPGRLQGPPYGSREYKGSGLLDL 
NASLLDPEVWAPSLSGPETKPPAHSPSD 
PGVHSSCSASQGSQGPGPLTPSSRIHEP 
KPPGPFFK 


12213 


26114 


A 


12327 


2 


506 


FVPLSETEAPGCSGSRPPELEPERSLGR 
FRGRFEDKDEQLEEEEELEEEEEEEEED 
MSHFSLRLEGGRQDSEDEEERLINLSEL 
TP YILCS I CKGYLIDATTI TECLHTFCK 
SCIVRHFYYSNRCPKCNIWHQTQPLYM 
IRLDRQLQDIVYKLVINLEEREKKQMHD 


12214 


26115 


A 


12328 


1 


419 


FRVPSCARCQYLPLLRGASQLGYSGVRD 
P VE E AVC P F S DLKLCAGRT TTL F KAVRQ 
GHLSLQRFLLPFVWLCPAPRSGVYRGRQ 
ASLSCGGLHPVGASWPRCLPTQASAMAG 
TPPPASLPPCSLISDCCASNERGSMGV 


12215 


26116 


A 


12329 


259 


414 


NMENSLRCVWVPKLAFVLFGASLLSAHL 
QVTGFQIKAFTALRFLSEPSDAVT 


12216 


26117 


A 


12330 


302 


404 


SCYRVSVISQARWLTPVIPALWEAEVGR 
IPEVGS 


12217 


26118 


A 


12331 


94 


375 


SRDMS PGLLTTRKEALMAFRDVAVAFTQ 
KEWKLLSSAQRTLYREVMLENYSHLtVSIi 
GIAFSKPKLIEQLEQGDEPWREENEHLL 
DLCPGWSAMA 


12218 


26119 


A 


12332 


7 


246 


NP IVDPS PCGG I RVRT P AGRGG PALRVR 
PETWEEAGEKMPS E S L CLAAQARL DS KW 
LKTDIQNCFITRKISLLPLFCHHL 


12219 


26120 


A 


12333 


150 


1 


YIYFFHLVRGLPGSGYDEYFWWVNTE 
S PSVTQAGVQWCHWSVQHKPP 


12220 


26121 


A 


12334 


34 


426 


E PGFLFVFFFL I RGKKTPLFFWGTRKNP 
KSGKPLFTPPGKSPSEPPKSRGGPIjFFQ 
TRSRLTPQSWFGGAPKPPLGGEPPKFWE 
AGLGGDPPTPRAPHRADQTLEQPIQAPT 
RI IQNEQWAI KP FQ PMPGG 


12221 


26122 


A 


12335 


185 


3 


TVITPLHFSLGHRARLWSRVPAPPAENR 
SMEGGLGRAVCLLTGASRGFGRTIAPLL 
ASHE 


12222 


26123 


A 


12336 


60 


316 


GWGP I PS TDVTVLS PQKPI LLQGHERAI 
TQ I KYNRE GDLL F T VAKD P VS VGWRGS G 
RGGILLPGRWVDTPVLPLAGNQRASHCE 
K 


12223 


26124 


A 


12337 


3 


167 


S FR I Q VQGNHTS KH Y P K I Q Y S D Q AQWLT 
SVIPALWEAKTGRSLEVRSLRAFRAP 


12224 


26125 


A 


12338 


238 


401 


RPLSVMCGRIiTS I RVEDSGARCWF YLF I 
YETEFRSVAQAGVRRRDLGSLQTPPSG 


12225 


26126 


A 


12339 


354 


433 


GWAQWFMPVI PALWEAKAGVS PE VRS 


12226 


26127 


A 


12340 


238 


382 


LL I VYNMMLLPDAVAFTCNS S TLGGQGR 
R ITRGQE FE TSLANM VKPHL 


12227 


26128 


A 


12341 


101 


1 


KTKQTNKKTRSVGQTWWFTP 1 1 PAYWEA 
QAGEL 


12228 


26129 


A 


12342 


276 


23 


GSCLLEGKLTNRKDIHTKTPSVRHHCQR 
PKIDKTTKMGRNQSRKAENSKNQSSSSP 
SKECSSLAATEQSWMPHDFDELREEGFR 


12229 


26130 


A 


12343 


122 


1 


YMGVNERGCGQS I FKSSLSSQLWWWAP I 
IPATQEAEAGEW 


12230 


26131 


A 


12344 


180 


360 


LHWYFFSVGDFLPPPPPPLDDSSALPS 
I SGNFPPPPPLDEEAFKVQVRAEVKVML 
GKS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=JLeucine 7 M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Ghitamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12231 


26132 


A 


12345 


35 


411 


TMT IVDKAS ESSDP SAYQNQ PGS S EAVS 
PGDMDAGSAS WGAVS SLNDVSNHTLSLG 
PVPGAWYSSSSVPDKSKPSPQKDQALG 
DGIAPPQKVLFPSEKICLKWQQTHRVGA 
GLQNLGNTCFANA 


12232 


26133 


A 


12346 


278 


386 


I IYCYKQT I I GQARWLTTL I PSLWEAKV 
GGSPEVRS 


12233 


26134 


A 


12347 


111 


2 


KSQTGQVQWXiTPVI PALWEAEVGGLLEL 
RSSRLAWA 


12234 


26135 


A 


12348 


215 


316 


LNKRLMISFIKKKKKKKKKKKKKKKKKK 
KKKKKKKGGPL 


12235 


26136 


A 


12349 


265 


404 


SNISYFLNNNFSKALSLNIKYMISQARW 
LTPVIPALWSAKVGGLPE 


12236 


26137 


A 


12350 


372 


1 


PVFPLPPKKLGEQLPPPALRFLAVSPLP 
KAAHEQE I KEKVLAVHKNP I DPVYGFPK 
KGPTNFFLFPLFQRVIFLGAQKGVSWFG 
FYVKGLSLAPKLGGPPFFFFFETESRSV 
AQDGVQWCDLGS 


12237 


26138 


A 


12351 


318 


416 


QGRAQWLTPVILTLWEAKAGGSSEVRSS 
RPAPP 


12238 


26139 


A 


12352 


144 


1 


NGYAIVE CWLFCFVL FLRQS LSVAQAG 
VQWCNLGSMQPPSTSIiVQAI 


12239 


26140 


A 


12353 


116 


1 


S GRYS FKKMKCGQERWLAS VI PALWEAE 
VGRSPEVRSL 


12240 


26141 


A 


12354 


102 


2 


ETRYKKQPGGWERWL.TP 1 1 P VLWEAEVG 
GSPEV 


12241 


26142 


A 


12355 


112 


294 


LIS PS CPMWPS PHGPTWLSRL ISQHSPS 
NLKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRGGGAF 


12242 


26143 


A 


12356 


138 


2 


SFFLKVFFFFFFEMESRSVAQAGVQWCN 
LjGSPQAPPPGSRHCPVD 


12243 


26144 


A 


12357 


148 


394 


PGE P PGE KGEGGKGKKPGGRGWGEP P CR 
GGFKEKFPRGKEKAPPLKENSRENLGWG 
TKIPPLGKKKPPPPQEQEMILGPPQF 


12244 


26145 


A 


12358 


97 


3 


EKFSPCFVRARTHTHTHTHTHTLSLSLF 
GFH 


12245 


26146 


A 


12359 


204 


2 


KHYTPAACFAPCLPDEAPI IAAAKPATT 
T S E Q KMAVP P KYANLGKS ARNVFNKG YG 
YGLINLLKTKS 


12246 


26147 


A 


12360 


156 


391 


NRGLFKVEESFS I PCS VRCS I HPSALVS 
PTPPTTDTTNWALFFETESCSIHQTGMR 
WRDLNLLQPLPPGFKRFSCLNL,' 


12247 


26148 


A 


12361 


179 


1 


ALKKLLI IDDNSHKTNIVKHFSFRNFTF 
LFLLETESRSIARLECSDANPTHCNLRL 
SGP 


12248 


26149 


A 


12362 


218 


391 


TILKDCTFKMICLSQAQWHIPWPATWE 
AESRGLLEPTS 


12249 


26150 


A 


12363 


2 


356 


TNSHVDNS ITQKPEFCIPRCCGYI VCQL 
QLVERAFI FNFFFFFLKRSFVLLPRLEG 
RGAI FRVTQEGSNLLTLGPAPLGLPKGW 

LKRNIiL ! 


12250 


26151 


A 


12364 


94 


1 


KNP P LiF FFFFFFESES C YVAQ TAVQWHD 
LGL 


12251 


26152 


A 


12365 


237 


2 


YS SLLFKPAESAKGI LRHHVTP KS TSAI 
SKILFCFLFFVETEFLSPGLKCSGAIKV 
HCSLiNLSGSSNSPTSASQVPGS 


12252 


26153 


A 


12366 


340 


442 


GLFVCFFVWFLETES CSVTQAGVHWCDL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














SSLQPS 


12253 


26154 


A 


12367 


208 


45 


EGGEFFFFFFSVFFFVRAGVLPCCPGWS 
ETPGSSDPPALASQSAGITGATVNST 


12254 


26155 


A 


12368 


60 


443 


CSS CVS SKPQS PHLKKQGDRMGAHL I S G 
G C T E AT AGKGMRKCL VAWS T CAGAR P P S 
RNRGS QGHS ARFQGPRLPT VVRLPP P P P 
QAKPFCKKWMLGGPIS IEPKPQHPPGPG 
KKKWAG KAAP RLiVP T 


12255 


26156 


A 


12370 


341 


421 


SGWVRWLTPVTPALWEAEAGGSPEVGR 


12256 


26157 


A 


12371 


280 


393 


NTIDHYVTQWLTPVIRTLWEAEAGRSPE 
VRSSRSAWPT 


12257 


26158 


A 


12372 


268 


399 


TLCSLSSAESIIHSYPEFSCKDNYLMTH 
NEAYKACMRRAFHI PL 


12258 


26159 


A 


12373 


209 


468 


EAVPDETSHPSPADSFSTSSNLACGTHH 
LRRLL KL YVE E E LE Y YPRK I Q CFHF S F F 
FFESRSLLLLPRLECSGSISAHCNLCLL 
GS 


12259 


26160 


A 


12374 


381 


488 


TITCLFQKCQRGRARWLTPVI palweae 
AGGSQGQE 


12260 


26161 


A 


12375 


276 


395 


GHWKTKKLKL I F VFFEMES CSVAQAGV 
QWHDLGSLQPPP 


12261 


26162 


A 


12376 


295 


472 


VYS S VALNTFTLLCNHHHQP S PEVLTFP 
NKKCRGAILAHCNIiHLLGS SDS PTS ASR 
VAG 


12262 


26163 


A 


12377 


390 


511 


PGAVAHACNP S TLGGRGGQ I TRI PS L 


12263 


26164 


A 


12378 


417 


511 


AHRS FFEMES CS VTQAGVQWRRLGS LQP 
SPP 


12264 


26165 


A 


12379 


133 


406 


KL S LNKRD YKKQ VCKE T KKKKKKKKKKK 
KKKKKKP GAQ KKKS GGS PR 


12265 


26166 


A 


12380 


247 


345 


HMW S S QLNKGL I NKKKKKKKKKKKKKKK 
KKKKKKKKG 


12266 


26167 


A 


12381 


156 


15 


NFFFFETESCSSVTQTRVQWFDLGSLQP 
GKQEQNYVS KNKQKEL IH 


12267 


26168 


A 


12382 


275 


1 


KIVFLKGPPLFFFFKKKFVFFLKVGFKG 
GLNF I KTFP PGGKP I SKKRQ I FFFFFFF 
ETEPHSVARLECSGTISPHCNLHLPSSN 
DSULSTS 


12268 


26169 


A 


12383 


43 


379 


LLGYDESRSLSWI CLCLSCLGQLLSFLN 
LLVYVFCQTCEVCFHYFFQYFSTLFLLT 
FWDSNVLNIRHFG IVPQVLDDIiVFFS PN 
KE I FLGQLQWLMP VI PAFWE PEAGRS PE 


12269 


26170 


A 


12384 


303 


1 


KGPFFFFGLSPFFFLKKVFFFPPFFLRT 
PPFFFFPPFFKNFFPPPPKKKNPFFFFP 
PPPFFFFFFFFFETRSRSVAQARMQWGD 
HGSPQPQPPGSSDPTRP 


12270 


26171 


A 


12385 


141 


1 


KKEPLRAPPPFQLFCFFYFETTPCSVTQ 
ARVQWCNLSSLQPPPLRYK 


12271 


26172 


A 


12386 


31 


417 


CRLADSPSPNDTGQDSRGRAGIKHIPPL 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKSGGGA 


12272 


26173 


A 


12387 


291 


412 


LS S EMLLFCFVYFLRWS FTFVAQAGVQ W 
CDLGLLQ P PL PG 


12273 


26174 


A 


12388 


310 


1 


MTPCPFLFFCPKKRKAGGGFIKKALFFN 
PQKEVFLGPPPKGPFFLGFQPTPPWGKK 
IPRLNPLFWRSSIFFFFLLDGVSRL.LPR 
LECNGVI SAHCNLRLLGSS 


12274 


26175 | A 


12389 


117 


2 


KRNPQRGGPLFFFFFFFETESRSWQAG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso!eucine, 
K=L,ysine, L=OLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQWHDLGSLQ 


12275 


26176 


A 


12390 


117 


319 


SFLKKPKDSTKNLLKLINEFSKGSGPKI 
TIQKSVAFIYPNGEHSEKKIREVITFTI 
AAKTIKCPPQA 


12276 


26177 


A 


12391 


95 


2 


SKSWTSLKLRTLGWARWLTPVI PALWEA 
KAG 


12277 


26178 


A 


12393 


127 


395 


I PGS QGFCL I KQS ALHLLNKS S FS FLS F 
IETQSLSVTQAGVQWCYLSSLQPPPPRF 
KLRQ KNRLNS GGRG CN KLRAHH YT P GWV 
TERDS 


12278 


26179 


A 


12394 


252 


391 


FFCFCFVLLCFFEMESCSVAQAGLQWPG 
LGSLRQGFAMLARLVSSS 


12279 


26180 


A 


12395 


275 


3 


TFRLV I LKKS KETVE INCLETKSHS VAR 
AGVQWCSQDWWEEKAGFlSrWRPSKARRCP 
TSVLKCHLGWAQWIjMPVI PALWEAKAGG 
SPEVRS 


12280 


26181 


A 


12396 


111 


3 


GNNVKIRGLRI RPLKGWACWFTPVI PAF 
WEAEAGG 


12281 


26182 


A 


12397 


124 


1 


VSRGRTVHQEKPTERAAS PRLECNGT I S 
AHHNLCLPGSSNS 


12282 


26183 


A 


12398 


259 


387 


YQQESFSCLFIFIFLFFIFYDTESRSVA 
QAGMQ WHDLGS LKP P 


12283 


26184 


A 


12399 


103 


1 


KGRFFFFFFYETESRSVAQAGVQWRDLD 
SLQAPP 


12284 


26185 


A 


12400 


308 


416 


CGNNFEKAGRVRWLTPVI PALWEAKAGG 
SLEVRSDR 


12285 


26186 


A 


12401 


92 


1 


HTWGLTQWLMPVI PALQEAKVGKLFEPR 
SS 


12286 


26187 


A 


12402 


265 


388 


GLS WLFRRPGHLFYFI FFETMS VS ITQA 
GVGCCDLGSLQPP 


12287 


26188 


A 


12403 


279 


1 


SLSSKMESGSLTFSTIDIWGWIXTLSCC 
PGHCRMFSS I RGL YPLDASS TTS SS CNN" 
QNSPDMGTCSLWSQLLRRLRWEDCLNTG 
GRGCSEPGS 


12288 


26189 


A 


12404 


387 


152 


NPPPGFNFGGPLKKNFFSPPRGEKFVFL 
KRPPPFFFFFFFFFETESHSVAQAGVHW 
RDLGS LQALKS FCDLVSAFKGF 


12289 


26190 


A 


12405 


262 


464 


1 1 FHWIiACILVIHS S FDGHGLFLPLAVA 
DSAAVKHSCTS I CLNIE YI PGNGI AGS C 
GISVLNFLRNC 


12290 


26I9I 


A 


12406 


204 


67 


KCFGQAWWLMP VI PADWEAEAGRSQELR 
SSRPAWAKELLNNRTRAS 


12291 


26192 


A 


12407 


134 


1 


IQRPCIQLFSTFFLILFYFIFTEIDSCS 
VAQAE VQWHD FGS LQ P 


12292 


26193 


A 


12408 


145 


403 


TYMYSLINYYKANSHVTKVNKHNTARFL 
ESLLCAHPNPPIKLIISLLLREKKKKKK 
KKKKKKKKKKKKKQK 


12293 


26194 


A 


12409 


65 


415 


RNKRI ISQSWNGPSRKPQSSALLCGLGH 
LTSGVRARVSPVQGCLVRIKGGSGPSKP 
KKKKKKKKKKKKKKKKKS S KKKKAQKGG 
ALKKK 


12294 


26195 


A 


12410 


124 


404 


PWWEFLHHRNCQMWPGTVAHTCNPGTLG 
KKKKKKKKKKKKKKKKKKKSSSSSS 


12295 


26196 


A 


12411 


164 


2 


TGAGLTIMDMVGVGPDLLQLRFCFVLFC 
FLVF FFE TES D S VAQAGVQWCNP S S L 


12296 


26197 


A 


12412 


114 


3 


PGMVAHACNP S TLGGQGGW I TRGQVYKT 
GLAKTVKP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12297 


26198 


A 


12413 


583 


391 


LADTGFHHVGQADLKPLTSCSTRLSLPK 
CWDYRCE P PHLAVP DTVLS TRGL ISEKS 
FLPLCTFR 


12298 


26199 


A 


12414 


280 


2 


KVSSMNAQTS VL FS TMLG I S ETLKYRGR 
KKRRERGREGGREGRSRKDECRKVRSER 
I EGGKKRGRKE YRE EGRRR I EVSANE E T 
PRDHPANK 


12299 


26200 


A 


12415 


264 


352 


RFLVHGCD I Y I C IN I YTHTHTHTHSHTH 
T 


12300 


26201 


A 


12416 


359 


485 


TDARMDWCYQEGTGSQAWWLTPVI PAFW 
EAEAGGS FEVRS LR 


12301 


26202 


A 


12417 


171 


1 


PLFRWWGFELFPPFGYQKKKGYPPFFFF 
E TE P C C VTQ AGMQW YD LRS LQPPPPEFK 
R 


12302 


26203 


A 


12418 


265 


1 


WHKS NE E LAAE AS AP VKARAS NT I LG PH 
THGKERCPVSLRNVLDEGVKSTNFFSFF 
ETDSHSVTQAECSGAISAHCNLHLjMGSR 
DSGA 


12303 


26204 


A 


12419 


278 


388 


SSLTGFFIRRRNEEPGQHGETPCLLjKIQ 

klagcggvl 


12304 


26205 


A 


12420 


205 


413 


WNLFVYLYQTHKYARVYSYNI ILFTNKK 
RAIGQAE WFTP 1 1 PAVWEAKVSRLPEVR 
SVRLSLPKWRITS 


12305 


26206 


A 


12421 


103 


2 


RSLSKHRTETILGLPAAVLI ILFPPLLI 
PTSKY 


12306 


26207 


A 


12422 


372 


478 


NI FVFL KE TPCQSQWLiMSVIP TLW E AE A 
GGSLEPR 


12307 


26208 


A 


12423 


174 


416 


KMHYCVLS AFL I LHLDTVAL I LS TCSTL 
DMDQFMRKRIEAIRGQILSKLKLTSPPK 
NYSEPEEVPPEVIS IYNSTRDLLQE 


12308 


26209 


A 


12424 


572 


733 


RFSCLSLPGSWDYKNCLNSGSRGYSEPR 
SHHC I PAWVTEQDS VSKQNKTKQS A 


12309 


26210 


A 


12425 


123 


2 


GYI FI F I YFFETGS CS VAQAGVQWRNLG 
SLQLPPPSSSDS 


12310 


26211 


A 


12426 


224 


417 


TADFYGVNYISIKLLPKKKKKRGGPFKE 
SKFTAAGLQKNIFFLSAPNFIALAWLK 
RRDWETPG 


12311 


26212 


A 


12427 


115 


3 


FFETGSHPVTQTGMQWCDCNSLQPLTNR 
LKQSSHLSIi 


12312 


26213 


A 


12428 


250 


398 


PWAGHLTSLGLSLP I FN I G I AGQAC WLM 
P VI PALWE AE AGRS LE VRS S K 


12313 


26214 


A 


12429 


215 


3 


KKGLWWGANKLWVFTPQKFFLNQ FS GW 
ALTPRGKSWFSKRRAPAPFFFFFFWETE 
SHSVAEAGMQWHDL 


12314 


26215 


A 


12430 


345 


2 


FSHHPAKPQGGLFYRQPLGLEGKETRPR 
LNDRFMAGPS FPAQQGLRMPFLGKEWER 
EGCSFVQCPVEGFAMWPYPLGSPPGPRL 
FFFFFFEAGSPSVTQAECSGS ITANCSL 
EL 


12315 


26216 


A 


12431 


348 


3 


LYS SLGDRVKLRLE KTKTNKWCLHS YI S 
TC KI INLDHYL I P CTKTNSEW I KYLNVR 
TNSIKLLEEDISVNLHELGSDNSFIiAMT 
P KAE 1 1 KE K I YKLH F I KI KI FH VS ED 1 1 
EK 


12316 


26217 


A 


12432 


160 


52 


NTAIGHPHQAI YLS YLS I I YLSIYLS I Y 
LSIYLSIYL 


12317 


26218 


A 


12433 


180 


2 


GRSKLHICREHSICAIEHVCGCDTENNT 
NLCQVQWLMPVILAIWEAEAWRSPEVRS 
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peptide 
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last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LRA 


12318 


26219 


A 


12434 


193 


3 


PGAENPPPKGGPPFFFPFPRGFPPGVFF 
L KKKG FPFFFFFFFDTES RS VAQAGVQW 
HDLGSLQ 


12319 


26220 


A 


12435 


321 


422 


NVEGDFLQRIKNIRYWPGAVAHTCGPST 
LGGRGG 


12320 


26221 


A 


12436 


116 


2 


HDRFCLFVYDRVSTIiS PRLECS SAI SAH 
CNLRLPGSSN 


12321 


26222 


A 


12437 


302 


400 


THTHTHTHTHTHTHSGLLYYLPKGKQSH 
EFCW 


12322 


26223 


A 


12438 


329 


399 


LALQARLECNGI ISAHCSLCLPY 


12323 


26224 


A 


12439 


310 


2 


KPILSRGLFCARPCFRRSNMRSFNPSQS 
YQVGKDTFGFTVKVTKLKVLGPSRGPKG 
ARQGVHKGKCLLTLPKMLRFLFFFETES 
CSVTRLECSGAILAHCNL 


12324 


26225 


A 


12440 


151 


401 


EGM P HL P GS HS S S N I LKRGRE S GRE AGR 
KGRKERRKEGRRERRKEEEKRGRGKIRR 
R KDRGENEE E ENNRL S S KVS LE I S TC P 


12325 


26226 


A 


12441 


276 


1 


FHPFGRPSVSPRVQPKIGI I IHTCVARI 
EKGGSCKALSLCPQYVGSNFFFFFETRS 
FSVAQAGVQRCDLGSMQPPHFPGSSDPP 
APASRVAG 


12326 


26227 


A 


12442 


140 


1 


NSHFPKNLGSPSSFFFFFYLETESRSVA 
QAGVQWLDLRSLQPPPPG 


12327 


26228 


A 


12443 


310 


3 


L P P C S WGL YGAFAR C P LMDD KQ L S Q VP L 
WACPLS TAGRTRLC VAC I KAAGKAQGF F 
FFFETESHSVTQAGVQWHDLGSLQPPTH 
AKPKHTHTHTQTHT I PDT 


12328 


26229 


A 


12444 


337 


2 


GFFAIVPKKGQKKMFSPFPKEGIFSGPC 
FISSTPTLLGVKKILPPNTSEIRPVLSS 
IINTSVHIREQKKTTYVGIFTGKSRFFF 
FETDSRS I AQVGVRWRHLGS LQAP PPG 


12329 


26230 


A 


12445 


112 


9 


GRVRWLTPAI PALWEAKAGRS PEVRNSR 
PAWPTR 


12330 


26231 


A 


12446 


172 


3 


I PGELL I GRAKFKAKNWPWAPRG I PLGR 
Q KF SFFFFFETE S RS VAQAGMQ WLNLG S 


12331 


26232 


A 


12447 


221 


418 


RKAERLE VFFRRLCQRALRP FPHCLAAA 
PMPL I VL KKP I LGRAWWLT P VI PAL WE A 
EAGGWLESRS 


12332 


26233 


A 


12448 


280 


392 


PLGRVRWLTTVI PAHWEALAGGS PEFRN 
LRPAWPDMA 


12333 


26234 


A 


12449 


410 


129 


PSQEIFPYPGPPGKTPFPLKKKKKNRGG 
GGGLPFPP P QKVKKKKF FY P GRGR FKQ P 
KFNP L P P PRGEKKNP KP P P P KKKKKKE K 
KRKEKKLVT 


12334 


26235 


A 


12450 


159 


1 


KKKRGGFLGLTFIKKTGPDPFFFFFETE 
SRSVAQAGVQWYDLGSLQAPPPGLM 


12335 


26236 


A 


12451 


270 


1 


NPLKFFFFSLFQNPLKKDPAI FNPFFFF 
FPRIFWPFFGGVSKAGIWFFFFPFFFFF 
FFFFFFETES RS VAQAGVQ WRDLG S LRT 
RGRTRG 


12336 


26237 


A 


12452 


249 


33 


D KL VKL P L I LKKLNS FFFFFETES RS VA 
Q TGVQ WRNLGSLHP P P P GF KPNAWAYAW 
AHAWWP GL PAQNLR 


12337 


26238 


A 


12453 


190 


86 


TNDLGWTQWLTPVIPALWDAEAGRSLEL 
RSTRKA 


12338 


26239 


A 


12454 


318 


2 


PPTFPLKTLFFPVFLPKLFSTLFSPKKK 
FSNFLVSAPPFFFLTLGNSQGVLLKGGP 
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F F F F F FFF E T E S R CVS Q VG VQ WRDLE KK 
E KKKCNSNGGRH Y I STHLKYK 


12339 


26240 


A 


12455 


204 


288 


IQLWTHTHTHTHTHTHTHTHTNETGKMW 
I 


12340 


26241 


A 


12456 


409 


13 


QGGAP P I P I MLS FLQE AGLLH VAQPGVK 
LLTSGDPPSPPSKSAWRQRQETRLNLGG 
RGR SEQRPPPRP P AWATQQD S VS KTNKQ 
TKNNKKIRKSIiCKKTLEKEMLAVMI ITK 
YLLCTDVPSTVLCTHLYYLT 


12341 


26242 


A 


12457 


204 


3 


KKKS PTPQSAKFPLGGGVFFFFLVGMRS 
PFVAQAGFKPLGSSDPPILASQSTAXTG 
MNHHTSSAFH 


12342 


26243 


A 


12458 


289 


3 


KAREKKKVGATKNI RI VDLS KGRTS LCK 
KTEMPPVGNRNKQKPQNPSNLRISTRPE 
TESRSVTRLECSGAISAQVIIiPGSSDSP 
ASNWRQSETLS 


12343 


26244 


A 


12459 


261 


388 


S WH FGRS KQ V Y CL S LGVRNQ P GQNP VS T 
KTTKI SQAWWHAPW 


12344 


26245 


A 


12460 


313 


391 


RVGRVHWLKPVI PALWE AVAGGSPEV 


12345 


26246 


A 


12461 


112 


7 


HICIYFFETESRSVAQAGVQWHDLSSLQ 
PPPAGFK 


12346 


26247 


A 


12462 


389 


3 


ALHEGAARGS P P P GGKKKKRG S P FP MGN 
QAPQVPGKLGGGFGFWFFKKSLPLQPGG 
KP GVF YKP KKVP T P KD P P P P TL WGGWGT 
TGGPPGPLFFFFFFFETESRSVTQAGVQ 
WRHLGP LQ P Q P P AHAS 


12347 


26248 


A 


12463 


26 


349 


C IM IDS TTG I F I HCWWECEMAHP FWKAV 
WQFLFFFFKRGVYFFFPGGIKGPGFGLR 
EPLPPGIKGVPRPNPLGGGDLRPPPPPP 
INLGFFKGKGVFMVWPGVGNFLI 


12348 


26249 


A 


12465 


187 


54 


GYKNKPGQAQWLTPVIPALWEAKAGGSP 
EVRSSRPAWPIWRTRG 


12349 


26250 


A 


12466 


199 


55 


1 1 QLRQQNS FL F I C FD I E S CS VAQAG VQ 
WHNLS SLQLE VPGFKHAPSH 


12350 


26251 


A 


12467 


107 


3 


FAHCLIGNSFFSFFFFFFEVGSCSVAQA 
GVQWSD 


12351 


26252 


A 


12468 


120 


1 


GNWCRAQWLTLVI PALWEAE VGRSPE VG 
SSTRLGLPTHRP 


12352 


26253 


A 


12469 


105 


2 


YYLYFI FFLFFLETGSCSVTQAGVQWKD 
LRSVTS 


12353 


26254 


A 


12470 


238 


405 


FH I CL F I L KNQ VS KM YTHT VS C S LLE 1 1 
YCCLLQWLTPVI PALWE TKVGGSLEVRS 


12354 


26255 


A 


12471 


425 


1 


LNPAPAI F FGG P KKKNF F S P PRGDKF F F 
FKRAPPFFFFFFFFFFFFQKKKKPFFPA 
QKKGRGFPGRALSQWPLAGFLKITQFPP 
WSRVPFSPLLFFFFDME S P SVAQAGVLW 
HDLLHL PGS SDP PTPAS RVAGI TGVCHH 
A 


12355 


26256 


A 


12472 


122 


2 


QNKYSELRMNTFGRPQWFTPVI PALWEA 
EAGGSPEVRSSR 


12356 


26257 


A 


12473 


144 


420 


GAPRP I SRPRGKTPGGGGKRGKPNKNGL 
GENPFFPKGVKGKNPPIFFPFLGQNGPT 
PLVLGSANMGFYLYLKRKGPGKGREALR 
AEPGQPFY 


12357 


26258 


A 


12474 


80 


1 


KGG QVQWLT P VI PAFWEAEAGGSPEV 


12358 


26259 


A 


12475 


54 


294 


LIKSWQHRQEYVSLMSFIFFPCHIWYQS 
QRSPGTTQI ILE I SDTGKNKYFNCTGSH 
RKHTHTHTHTHTHTHTHAHTHTL 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine ? H=Histidine, I=IsoIeucine, 
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12359 


26260 


A 


12476 


364 


234 


E EMG FHHVS QDD LNLLNS W S ACLGL P KC 
WDYRSETSRLASLPF 


12360 


26261 


A 


12477 


288 


450 


PGTVAHACNPSTLGGPGWSRTPGIiKRI I 
CLGLPKCWDFRRELLPSLPCLLI FL 


12361 


26262 


A 


12478 


271 


396 


TISFLSFFFLFLFFFFFFWRKSGRESRF 
SQKKKKYKKKNIYY 


12362 


26263 


A 


12479 


289 


2 


GSCLLERKLTYRKD IHTK3STPSVPHHHQR 
PKVDKTPKTGKKQGRKTGNSKNQSASPP 
PKECS S S PATEQKWMENDFAKLREKGFR 
RSNYSELQEEI 


12363 


26264 


A 


12480 


323 


1 


YSMVYPYVFLTAKKDQLQVNNTQLTCKS 
YQLYHCINHSTLQTHNISTLMILGHIPG 
LCIPVNLSEAWAATPALHFVKLIiLTELT 
HH VCRALD III LA I VS L VAL ITS 


12364 


26265 


A 


12481 


279 


429 


MNGQRKCVYVYTL YT YTTE Y YS TLKRRK 
WPGTMAKACKPNTLGGHGGWTA 


12365 


26266 


A * 


12482 


308 


423 


LTFFKNITGFFFFFFETEFCFVPQAGAQ 
GGDLS SLKAP 


12366 


26267 


A 


12483 


272 


474 


MVS WGL S RAK I P W VL S F FF FFE KKS HF F 
FQGGGRGGDLSWEQPLPPRSKGFSCFTL 
PSTRDYRPAGP 


12367 


26268 


A 


12484 


84 


229 


LFKSVCHLLPLSSCSSHVRQACLPFTFC 
HDCKCSEASPAMLPVQPAEL 


12368 


26269 


A 


12485 


392 


3 


TG KKKRF P F F P KKK I FN S KPQ F S WRKKK 
LFKIFSPPGGAPFPPGVQKKKTFFFFYP 
FSPRTPFFS I FFPGKGPPKGGGLGPPFF 
PQKRGGGFPNSKWETWEKFPPLFFFFFF 
ETE SHS VAQAGVQWHDL 




26270 


A 


12486 


76 


1 


KLI SWTWWHMPWPATWRAEAGDFS 


12370 


26271 


A 


12487 


204 


8 


FFI FFFFFFFFFFFCFKTGSHS VAQAEQ 
EWHDHGSLQPQPPQSFHLSLPCATMPGL 
FFLFLFLFT 


12371 


26272 


A 


12488 


96 


3 


KKPLGQAMFFFFFETESCSVAQAGVQWC 
DL 


12372 


26273 


A 


12489 


126 


1 


KP PHP FLPFFFF FETE SHS VAQAGVQWC 
YLGSLQAPPPGFTP 


123 /3 


26274 


A 


12490 


107 


2 


IAGRAWWLMLAI PTLWEAEAGGSLEPRS 
SQPTCAT 


12374 


26275 


A 


12491 


302 


2 


GGFGFFPPREKGGFFQTVLFGVPPGFFS 
P P VFKTGPGVFFLGAQKKKI FFP PPGGK 
IWFFLRGAPLFFFFFFFFFFFEMTSCSV 
AQAGLQWCDLGSLQHP 


12375 


26276 


A 


12492 


248 


398 


PTEQVTLG I TAQS YSRVHINNRVYDLAV 
GSGHPDGAAAI KGS FVQRLKS Y 


12376 


26277 


A 


12493 


341 


1 


KTSHFREIYPLEHASSLSKKIETGALPC 
SQELLMSQENSVWKFPCLNQLPFQDVN 
SCI IRFQTKDTLKHTAIQPEAKTLSLPC 
YTHTHTHTHTHTNNHS I FELLCMQCDS Y 
N 


12377 


26278 


A 


12494 


95 


3 


PL FFFE TKS CS VAQAG VQ WQNLGS LQ P P 


12378 


26279 


A 


12495 


62 


420 


CAAKLCTEAHTAASADTHTS PHGS VSGL 
FCFHFPPHRREQRS RPGRKPGAHRLAGR 
ALSQKPVGSGATPHNLHHQ IRTQTNGL I 
QLLGDRQAPWVTPVLPVLWEAEAGGSLE 
ARSLRPA 


12379 


26280 


A 


12496 


58 


492 


NSPPPPAPSQRTSPPAAAASPTTGSSSA 
PCPASSSWPRSSIALITFYPDQPCGLSF 
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NNLAWKQNVLCFKERLLEKMFRRIQVLG 
LLVWTL I AGTE YFRVPAFGWVMFVAVFY 
WVLT VF FLIIYI TMTYTR I PQVPWTTVC 
LCFKR 


12380 


26281 


A 


12497 


2 


198 


RLQGILS AGFLVPGDTSVS S PRPRAAGS 
QSSTSRVLWKLSGAMLRLYVLVMGVSAF 
TLQPAAHTG 


12381 


26282 


A 


12498 


71 


346 


GMFMT S KE TF TH YQ P QGNS D P AHTATAP 
GGLSAKAPAMTPLMLDTSSRKLVAWDGT 
TDGAAVGILAVAADQTSTTLTFYKSGTF 
RYEDVLWP 


12382 


26283 


A 


12499 


122 


59 


HQIYIYIHTHTHTHIYIYIYI 


12383 


26284 


A 


12500 


220 


3 


P KKE SVRCWAMKW ILLILCQTS LiHWS L 
PQSSIiAATPCCLNLGTVDVLQGLGAVAH 
AWNPSTLGGRDGWITS 


12384 


26285 


A 


12501 


125 


1 


HVLVLSHFLFVLFLFFETESRSVTRPKR 
SSAVSAHRNIRLS 


12385 


26286 


A 


12502 


373 


476 


ILTRDGVSLLPRLECSGTISAHHNLCLL 
GSGDYP 


12386 


26287 


A 


12503 


440 


573 


S TW E DHL S LGNRG C S E PRLHL CT P T W VT 
E 


12387 


26288 


A 


12504 


146 


5 


HPHLQTMNGAP I PQELTPLLEKERDGLR 
CRGNRSPVPGIPPRWSRDL 


12388 


26289 


A 


12505 


140 


44 


AQGLTP VI PALWEDE VGRS PEVRS SRSD 
WPTR 


12389 


26290 


A 


12506 


247 


420 


LEISVTFSGSFGCLPKKHIHTHTHTHTH 
THTHTHTHTHT Y I RT I YTTLAR P L S ALH 
N"F 


12390 


26291 


A 


12507 


346 


473 


KAY 1 1 GLNCF YE T KS H S VAQAGVQ WRD L 
NSLYS 


12391 


26292 


A 


12508 


87 


414 


RTGVYRVGKDGRSRS PDLL I CPPLGLPK 
CWDYRREPPRPACLPL.LSHPSPPPHFSF 
LSPFLSFSLPCSPFLYIFFPPPLLPISL 
RWRLI PEGRFMAPLCLQMPLGHDT 


12392 


26293 


A 


12509 


386 


467 


LLLFFEMESHSITRLECSGAILAHCNL 


12393 


26294 


A 


12510 


344 


472 


KFLPFDPAVSLLGJYPKESKSFYQKDTC 
TRMFIAILFTIANTW 


12394 


26295 


A 


12511 


424 


132 


RGEGPPRSGILSPVGPPGETPPFLKKQK 
I TRGGGGGP LF P PLKRVRGENS FP PGGK 
S FHGAKF P F CP P P WAT KRNS VS KKKKKQ 
TKKGSRIILYNNE 


12395 


26296 


A 


12512 


351 


510 


GGTDFCKQS KLQS LQPAVE I F I PGRVQW 
LTPVI S ALWEAEAGGS PEVRRSRPA 


12396 


26297 


A 


12513 


362 


463 


NTITWLGAVAHACNPHTLGGPGGRITWG 
QEFETS 


12397 


26298 


A 


12514 


137 


1 


EE I SLLL PRLECNGAI TAHHNLHL P CS R 
NRRPGKAVHVRQAYS FQ 


12398 


26299 


A 


12515 


251 


592 


GAFTGLAFTMAGGRPHLKRS FS 1 1 PCFV 
FVESVLLG I VILLAYRLEFTDTFP VHTQ 
GFFCYDSTYAKPYPGPEAASRVPPALVY 
ALVTAGPTLTILLGELARAFFPAPPLAG 
PV 


12399 


26300 


A 


12516 


3 


413 


SWGGRKFLCPPRSLSGSRELHPAQGDRP 
GPLSSSGKRETGTHRETLGKKKVSSAPE 
AQGAGLRLSQALGGLCYDLSPLTEPRLP 
LAATAFPRPCPALPHPQPRVTMGSVSSL 
I S GHS FHS KHCRAS QNKLRKS SHLK 
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12400 


26301 


A 


12517 


115 


2 


KSHGGHGVEVLVSVLVGLAMEVGLVLWV 
KEYDGAALA 


12401 


26302 


A 


12519 


444 


34 


YCLRQKHTE YRRRSNLEMAS S CLSGRKT 
CTFFTLYQNLKM I KXSEEGTWQAEWLKS 
RPLAPNSKWEAKEKFLKKIKSVTSLNT 
YILRKPNSLIADMENILVIHIEYQTSYN 
IPSSQRLIQSKALCLFNYLEMECRI 


12402 


26303 


A 


12520 


1088 


935 


ILIIFINYRCEFLRSKKSSEEITQYIQS 
YKGFVDRTVMYNSNFFLKFLDYL 


12403 


26304 


A 


12521 


2 


347 


AMAAGSRTSLLLAFALLCLPWLQEAGAV 
QTVPIjSRLFDHAMLQAHRAHQLiAIDTYQ 
EFEETYIPKDQKYSFLHDSQTSFCFSDS 
IPTPSNMEETQQKSNLELLRISLLLIES 
WLE 


12404 


26305 


A 


12522 


50 


205 


WGESDRQSSGFWASSAHACKPTWEVET 
GGSLEARSWRLQCTMFASVKSTRP 


12405 


26306 


A 


12523 


105 


2 


RFVCS TI KVLRDLSSDRSNPGRFLSTSN 
SSLYER 


12406 


26307 


A 


12524 


279 


455 


KTKNWLALGGPPCFPGQYGETPSLLKNE 
KQAGHGGAPLYSRLiLRRLRRGNSLTPGD 
RGC 


12407 


26308 


A 


12526 


305 


454 


LYFLLFPFQ I IMRMVLFFVFCFFFETES 
HSVPQAGVQWRDLGSLQAPPPG 


12408 


26309 


A 


12527 


363 


459 


CTILGQAWWLTPVILALWEAEAGRSPEV 
RGSR 


12409 


26310 


A 


12528 


182 


36 


QH I WFHKE HGQ VRWLT P VI S AL WKAE A 
GRSPEVRSSNAAWHACRWSA 


12410 


26311 


A 


12529 


388 


530 


KSFLNFFPRDGILNCVPQAGLELGSSDP 
PAS AFRVAGTT S VWHHS Q I 


12411 


26312 


A 


12530 


134 


1 


GH KKG VL F F F F KT E S RS VAQAG VQWCTL 
GSLQPPLPAHATRPRV 


12412 


26313 


A 


12531 


230 


1 


KKFGFFPQAGPQGGLHLFTTLAPWGQAI 
SQKREKPFFFFFFETEPHSVARLECSGT 
I S PHCNLHLP S SNDS QASTS 


12413 


26314 


A 


12532 


145 


3 


KKLE I LSAI IPPKFRKEI FLFFFYETES 
CSVAQAGVQWRDLGSLQAP 


12414 


26315 


A 


12533 


365 


2 


LFFFTSLIWQITLGFLILNCPCIPGVCI 
YIQFANTFRKFAFLFTNEIAYNILILPL 
TKQDYTNLI KLI KKHS FFSNFLKQLLFV 
FCFVFVS LFAME S CSVAQARVQWRNLGT 
LQPPPPRFK 


12415 


26316 


A 


12534 


313 


475 


FFWSCSSFQQGFGGGITI II I ILRWILA 
LSPRLERKWRDLGSLQPSPSGFKRFF 


12416 


26317 


A 


12535 


124 


3 


MGSLIMGAELSVYRQVCDCKPQGVCVCL 
CVCVCVCASTCM 


12417 


26318 


A 


12536 


333 


58 


ACNPSKVRSCRPAWSNMVKSCLSKNAKI 
TKEWWWANFFFVFVFLVEMMFHHVGQLS 
LKLLTSSDLAASASQSYGITGVNHYAQP 
ARTRGSFR 


12418 


26319 


A 


12537 


214 


3 


SD PRRACRC P KEAPEQQRRLPGGS VP AL 
SIFFFFKTESCSVPLSPRLECSGVISAH 
CKLRLPGSCVPPAS 


12419 


26320 


A 


12538 


146 


1 


MNFLAFTNP PRP PQQ I LKPLLF I LRRES 
HSVAQAGVQWCDLSSLLPPA 


12420 


26321 


A 


12539 


195 


1 


IFIPPQAQKRGDPFLSFFFFETRSPSPR 
LECSDAITAHCSLHLPGPGEPPTPPIPK 
SWDHSHVPP 


12421 


26322 


A 


12540 


151 


334 


LLGRLRHKNHLNPGGGGCSGTIMALCNI 
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LLPRSSSSPT S AS KVAG I TGACHHAC CL 
RYFLY 


12422 


26323 


A 


12541 


314 


402 


NLNSGPAWWLMPVI PALWEAEAGGS PE V 
G 


12423 


26324 


A 


12542 


339 


3 


GSCLLEGKLPNRKDIHTENPSVHHHHQR 
PKVDKPTKMGKKQNRKTGNSKTQSVSPP 
PKERSSSPATEQSWMENDFDELREGFRR 
SNYCELREDIQTKGKEVEIFEKNFERV 


12424 


26325 


A 


12543 


31 


365 


RSAVMPTEDRATWKFNYFLKI IQLLDDY 
P KC F I VGAND VG S KQMQQ I RM S LRGKAV 
ALKGNNTMMRKAIRGRLENNPALEKLLP 
HI RGNEGS VFTNEDLTE I RDMLLGNKV 


12425 


26326 


A 


12544 


278 


410 


FS FKNVTAGRVWWLMP VI PALWEAEVGG 
S PE VKRSRTTLANMVK 


12426 


26327 


A 


12545 


88 


276 


EVPQAHRKLPSAPQLRPLQAISTKGPCP 
PQSPQPSPRPEGSSAHSRGEKRTRERGK 
EEKRKKK 


12427 


26328 


A 


12546 


251 


3 


PVEDNSKRRFQTGEEKVTQEEQEALTPW 
RTNF F CF S D RVS VAQ ARVQ C CD S GS LQ P 
QLPGS SDL PTSASQVAGTTGMRHHAC 


12428 


26329 


A 


12547 


106 


2 


DEHLPFLTSFETESCSVAQAGVQWRDLG 
SLQAPP 


12429 


26330 


A 


12548 


225 


1 


GFFKKEPHPRPCKKKTPPKLKPFWGGLN 
KKIPLSFVFSKKGPLPKKFFFPFFFFET 
GS CS CHPAGVQWCDHGSLQ 


12430 


26331 


A 


12549 


218 


3 


VPPPGLVPGLPQIIPKGFPKTKGPPKGP 
PPGDKQKSGPPNKKNFFFFFLRRESHSV 
AQAGVPWCDLGSLQP 


12431 


26332 


A 


12550 


276 


24 


GPLKMVEAPPQNPLSPKNKIFFFFFFET 
E S HS VAQAG VQ WRNLRL P GS S D F PAS D S 
RG AG I TGARPHVQL I F V FIj VKTGRKRR 


12432 


26333 


A 


12551 


331 


440 


KSASSWS FNS KGWGW WLTP VI P ALWE AE 
VGGSLEVR 


12433 


26334 


A 


12552 


136 


1 


RHTGS KT TATALVD VFLLL FFETE SHS L 
SRLECSGTTPGHCNLYE 


12434 


26335 


A 


12553 


84 


1 


I FFCLFS FEME SHS VAQTGVQWCDHGSL 


12435 


26336 


A 


12554 


3 


349 


HASGPEELSKDPSLVSQGQPHRKPGLKR 
CSSRPLGPPDKLGGEGKQGLLGFTLWLS 

GPFKKKLFFPPGGGRNFFFLGAPKFFWA 
GRF 


12436 


26337 


A 


12555 


257 


417 


KQLHLLQGRLFSPSFLPSISKLFFFEME 
SRSVTQAGVQGCDLASLQAPPPGFT 


12437 


26338 


A 


12556 


268 


403 


YMRLSFQDLVMFKDVAVDFSQEEWECLN 
SYQRNLYRDVI LENYSN 


12438 


26339 


A 


12557 


286 


426 


DML I KT C CHL VCE KKKKKKKKKKKKKKK 
KKKKKKRGGAL KKKPLPPP 


12439 


26340 


A 


12558 


114 


1 


PLKEIANRHMKKCS SSLAIRQMQ I KTTM 
RYYFTPVTMA 


12440 


26341 


A 


12559 


106 


1 


FFFFFFFFFFFYFFFFFFEMESRSVAQA 
GVQWCDL 


12441 


26342 


A 


12560 


118 


2 


KN YSNNL KTNAGRAQ WLT P A I PALWEAE 
AGKSPEVRNW 


12442 


26343 


A 


12561 


196 


2 


MCCFSKNWQEGAQMTPGFEPPQDNSEKT 
KLSDTRLLFFFFERESRSVAQAGVQWRG 
LGSLQAPP 


12443 


26344 


A 


12562 


128 


3 


RAPPFFFFETEFRSI»PRLECNGAISAHR 
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NLRIPGSSDSPAS 


12444 


26345 


A 


12563 


277 


370 


WSFTLVPQAGAQWCDLGSLQPALSRFKR 
FSR 


12445 


26346 


A 


12564 


55 


251 


AMVIPTVPFNITINSKPLGHISFQLFAD 
KFPKTGENFHTLNNKDKGFGS CFHRI I P 
EFICQGDDF 


12446 


26347 


A 


12565 


129 


3 


KYSVLRPGTVAHTCNPNTLGGRGGQITW 
GWEFETSLANVAK 


12447 


26348 


A 


12566 


172 


1 


PPPKKKKPRPPKKKNFFFFFGAISAHCN 
LCVPGS SDSPAS PSRVAE I TGS RCHAE P 
Y 


12448 


26349 


A 


12567 


257 


380 


GICRPLLGGVLQSGATGVRDPLEEAVCP 
LAELKHCAGRSTA 


12449 


26350 


A 


12568 


120 


r 


GVFLFCFVLiFETESCSVARLECSGMISA 
HCNLRLLGSSNS 


12450 


26351 


A 


12569 


250 


2 


IGKPKTPQFWFLKTKRGGPNAAFSFKKF 
FFQLKPPPRVFLKI FSCQKKIFFFFFFE 
TESRSVPQARVQWHDLGSLQAPPPGF 


12451 


26352 


A 


12570 


338 


3 


VSTPEKNFVFHTRGFSKKSSKYKVCNFF 
FQKKLGSPPFSLKKVFPGKPPYSCCSGR 
VSTAPYFLNPGPSTKILPPFFGPLGPEE 
KFLS F F F F FEME CR S VI QAG VQ W CDLG 


12452 


26353 


A 


12571 


5 


556 


ICCCLCFKINIFQLHFNKINFFCVTRSG 
P VTQAGVQWCNLGS QQPQPLS SKQS 


12453 


26354 


A 


12572 


115 


2 


DRVS LL VLAHCNL C LRGS S D S P AS ASR V 
AGITGARHD 


12454 


26355 


A 


12573 


279 


377 


GQVQWLMPWPAFWEDEAEGLLQPSSSR 
PAWAT 


12455 


26356 


A 


12574 


117 


1 


KTPLFFFFFFFEMESRSVAQAGVQWYDL 
GSLQAPPNGFT 


12456 


26357 


A 


12575 


253 


374. 


S LPGM WHACNPS P LGGRGRR I T WGRKF 
GTSLANMAKPCL 


12457 


26358 


A 


12576 


130 


2 


KFPGPFLGFFFFFFEMESRSFAQAEVQW 
RDLGPPQAPPPRFT 


12458 


26359 


A 


12577 


132 


3 


RVPGPPLiLFFFFFFEMESRSFAQAEVQW 
RDLGPPQAPPPRFT 


12459 


26360 


A 


12578 


3 


370 


LRKNCLNLGGRECSELRSHHCTPAWVTE 
RDSVSKKKTLSVLMEKWPNFPLRPLALS 
RKKFPRGPFEQVTHLVKEWSLTKPCCA 
EGADLDCYDPRTSPLFAKSWERNFPFPV 
HPGPVEGCPK 


12460 


26361 


A 


12579 


131 


2 


FIYFLFYFLFLETESLSVAQAGVQWHDV 
GSLQTPPPRFTPTRA 


12461 


26362 


A 


12580 


202 


2 


RMRKNQHKRAENS KNQNTS F P P KDHNS S 
QTREQNWMENEFDELTEVGFRRWVITNS 
SNLKEHVFSC 


12462 


26363 


A 


12581 


109 


2 


KI FFFFFFETESRS VPQARVQWHDLGSL 
QAPPPGF 


12463 


26364 


A 


12582 


105 


375 


LIMILLPLQILVHSGTFCFLFFGFETEF 
HSTPRLECNGVMSAHCNLCLPVKTSPAS 
TSQIS 


12464 


26365 


A 


12583 


106 


3 


RPIFFFFFFQTESRTVARAGEQWCDLGS 
IQPPPP 


12465 


26366 


A 


12584 


484 


700 


S S S WNRAF S RKKD KT WMHTP E AL S KHF I 
P YNAKFLG S TE VE Q P KGTE WRDAVRKL 
KFARP I KKSEGQKKKK 


12466 


26367 


A 


12585 


301 


1 


TQPKVRTQMKNNIFEHAFLSSTFPWPLP 
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NLPADFYSATSQTKPDILQSMQKAFKNS 
GIGQAWWLIPAI PAALWKAE VGGLLEPR 
NLRPAWVYGKTLS LQK 


12467 


26368 


A 


12586 


76 


379 


PLHPPASCPAPSLSCLIPGSPSDPLFSD 
SARPGWGRGWGRAPQLPPSPPFLYNLIK 
KWSRFCHQKKKKKKKKKKKKKKKKKKKK 
KGFFFWRAPPFFFFGGGFFS 


12468 


26369 


A 


12587 


109 


376 


LEFFKMSGWPTAPEQPAGEMENQTKPP 
DPRPDAPPEYNSHFLPGPPGTAVPPPTG 
YPGGLPMGYYSPQQPSTFPLYQPVGGIH 
PVRYQ 


12469 


26370 


A 


12588 


323 


1 


KKFCPPRFFPPLFGFQKKGTNPISPFSG 
GFKKNLGGFNPPPRPPWSPQIGFFLGRS 
KELILGLFPTRIGIGGFFS FFFFCETKS 
HPVMRLE CSGVI SAHCNLRLLVS 


12470 


26371 


A 


12589 


262 


1 


GDPPTTSGPQTNQPKEHLMNFKSDSQLY 
EDTLAGRS VL I KNLT PQTLQPRWTGP YL 
VI YS TPTAVRLQD P PHWVHRSR I KLCP S 
EKK 


12471 


26372 


A 


12590 


284 


374 


TESRNWGWAQWLTPVILPALWEAEAGGS 
PE 


12472 


26373 


A 


12591 


297 


387 


GQAWWLTS VI PAFWE AKEGGS PDVRSSG 
LA 


12473 


26374 


A 


12592 


206 


407 


VKPQNPRKAQKLKGLVYIH I Y V Y I YVYV 
YIYIYIYMYICRYISLTWYTVNSKEKG 
LDTAAHTCNSR 


12474 


26375 


A 


12593 


222 


379 


LTTGS I MGNFS FL I LVYTRKVMGS VQWL 
TAVI PLLWE AE AGGS LGSRS S RPA 


12475 


26376 


A 


12595 


301 


189 


KKIQSQAQWLPPI I PALLEAEAGRSLEL 
KNSRSAWAS 


12476 


26377 


A 


12596 


209 


3 


S VKP L F F I N YQ S Q WL FLFFRGRL KRS G 
TISAHCNLHLPGSRFSCLGLPKCWDYRC 
APPTLKIDLTKD 


12477 


26378 


A 


12597 


306 


3 


NGKKGFYMAPEPPFFFHFFFFSPLFPTL 
FFFFIFPLPFCFALSFLVGSSSLSPRLL 
CPMQHYFCDAWNTFDALIWGS I VDIAI 
TEVNVSTWRLSLTVRV 


12478 


26379 


A 


12598 


1 


249 


NTGAQ CPLEFSI QELQLLFLGGGEDGVS 
LLLLRLECNGAISAHCNLRFPGSGDPPA 
SASRVAGITGACRDHTCEPPRPVPPSS 


12479 


26380 


A 


12599 


228 


404 


RLSSLSPVTEFGCLSPPDLILKCDPWPG 
T VTHT CNP S TLGGRGGW I TRGQ E F KTS P 
VNM 


12480 


26381 


A 


12600 


101 


580 


LSIiTKNCALLGEETMMEQEMTRLHRRVS 
EVEAVLSQKEVELKASETQRSPLEQDLA 
TYITECSSLKRSLEQARMEVSQEDDKAL 
QLLHD IREQSRKLQE I KEQEYQAQVEEM 
RLMMNQLEEDLVSARRRSDLYESELRES 
RLAAEEFKRKATE CQHKLLK 


12481 


26382 


A 


12601 


464 


3 


CAIQQTLYEHPMKSSRLGPTQLKIFTCE 
YCNKVFKFKHSLQAHLRIHTNEKPYKCP 
QCS YASAI KANLNVHLRKHTGEKFACDY 
CSFTCLSKGHLKVHIERVHKKIKQHCRF 
CKKKYSDVKNLIKHIRDAHDPQDMYCGR 
SRGSARMSRALPS 


12482 


26383 


A 


12602 


140 


1 


WIGSFLSSEEDGTWQKISGVQRSLSLY 
VYI HTHTHTHTHTHTHTV 


12483 


26384 


A 


12603 


112 


2 


ESLRPGAVAHTHNPSTLGGRDGWINKDK 
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EFKMCERV 


12484 


26385 


A 


12604 


139 


1 


KKRVKRGPPGGFKGLGGKFFFSPKTFFF 
FFFETESCS VAQAG VQ WR 


12485 


26386 


A 


12605 


204 


460 


SGQGNGE I CHPRGFCTLPKAI SSLL I AR 
VCLKHFNKGVASLSQTANGCFFEGFSET 
ESRS VIQAGGQGGDPGSLQPPPPGS CLS 
L 


12486 


26387 


A 


12606 


317 


1 


KRPLFKNIFQKPFFKNGAPRKKGPRCIF 
GPNQKNPLFFPAPLSPPFWETKKGAGFF 
FWGFPFPPTKI IGGQKVPFPPTFPPFFF 
FFETESLSVTSLECSGAISAH 


12487 


26388 


A 


12607 


248 


367 


TQ I LS SGS VQS SLHHP P I CF IHTHTHTH 
THRHKHPHS 


12488 


26389 


A 


12608 


2 


387 


QKQQQRAGRETSTCSL.RI ISAPTMATFV 
ELS TKAKMP I VGLGTWKS P LGKVKEAVK 
VAIDAGYRHIDCAYVYQNEHEVGEAIQE 
KI QE KAVKREDL F I VS KVQWC I WLAFNT 
IVPKSQWQSLQTPQRC 


12489 


26390 


A 


12609 


258 


23 


MWERG I IRPQTQTIjTGEPEGLDHGRS I S 
PNS VMNG I VPH I S I LT LN I NGLNAP L KR 
PGIAEWIRIYQTSMCCLQETGR 


12490 


26391 


A 


12610 


176 


433 


DALSSAWGTVLGTQIPSHFTEILMLSPP 
AWGSSSLTQTLFYVPSGAKRTGSYVLAR 
VGQKYKTLWLGAVAHACNPSS LGGRGRW 
IT 


12491 


26392 


A 


12611 


312 


410 


AHMYRTCSRG WVS WLTPGI PALWEAEAG 
GSLEP 


12492 


26393 


A 


12612 


3 


289 


VFEFLSRKLSYILRMFWTFKEWFWLERF 
WLPPTIKWSDLEDHDGLVFVKPSHLYVT 
IPYAFLLLI IRRVFEKFVASPLAKSFGI 
KETVRKVTPNT 


12493 


26394 


A 


12613 


294 


432 


FMKLFFFFFFFERRSCFVAQSGMQGGYN 
ASLQPLiPFGSSNFSGLPL 


12494 


26395 


A 


12614 


138 


375 


SPNATCGHADPLPLCCLHICQALKRFTR 
QTFFFEGKPYLGILLYAVFFFLRLESCS 
VAQAG VQQPNLS S LETPP P S A 


12495 


26396 


A 


12615 


232 


1 


GVPQRAKLRAGLC P PQAMDTFS TKS LAL 
QAQKKLLS KMAS KAWAVLVDDTSS EVL 
DELYRATREFTRSRKEAQKML 


12496 


26397 


A 


12616 


236 


419 


LEMRNGWVWFFPPVNPTLWETEGVRDQP 
GQHEETLSLKI I FKISRHGGMRLGSQLL 
RRLRQ 


12497 


26398 


A 


12617 


197 


1 


LCHESEGFLSRVSALLFTLLLLVLMESP 
PSVSQPGVQWPHFSSLQPLLPRFKQFCN 
TLPSGWDYK 


12498 


26399 


A 


12618 


132 


3 


NS FMFHLFTYFEMES CS VAQAG VQWRDP 
GPLQAPPERVGGRV 


12499 


26400 


A 


12619 


255 


2 


MNS LNS FTD I AVI PLHYNR I LPHFKI I L 
S YFLKS KFQVKGIGRVKWLTP VI PTLWE 
AEVGRSLEPRNCPAHQPGQYGKILLYKK 


12500 


26401 


A 


12620 


141 


2 


FHFTYIIISCVCVCVTESRFVARLECSG 
AISAHCKNHRPGSSNSPA 


12501 


26402 


A 


12621 


274 


421 


TYVMEVCCTYYFLAQVLSLAPFNYFI IR 
PDPRTDPRPDPRPDPRPEPRP 


12502 


26403 


A 


12622 


132 


2 


NQS I KKKI ELQGFQRSGGRGKGQVQWLT 
HI I PARWEAKVGRS P 


12503 


26404 


A 


12623 


390 


86 


IFP FF YQNTNGMAPGNQ I PQGFS SRFFF 
FLRQVLALWPSLEYTGE ITDQCRLKCS S 
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SLSLCSSWDNRCAHPPLVNILFYVETGS 
HYWHIGLKLLHSRYPG 


12504 


26405 


A 


12624 


297 


396 


I IRVRGRAQWLTP VI PALWEAKAGGS PG 
VRKSR 


12505 


26406 


A 


12625 


303 


381 


KFHLVPS INTMSGSQELQWMVQPHFL 


12506 


26407 


A 


12626 


318 


152 


GDKRRLVS KKKKNLWNFGR IQW VP P VI P 
ILLEGEAGGSPEVRSFRPTGQQSMTPF 


12507 


26408 


A 


12627 


48 


388 


QLGNGKVRLLYQRHSSSFFFFWGEGFPP 
GPQGGGGGPNLGPRESPPPGVKGTPPPS 
LLEGGEPPGPPPPPGNFWGFWKKGGLPL 
YPGGPPTSGPKGEAPPTPPKGGGGIKGL 
P 


12508 


26409 


A 


12628 


116 


3 


KVATNQAQWLMP VI PALWEAKAGGS SE V 
RSSRTADAW 


12509 


26410 


A 


12629 


227 


400 


VLFVLFFCKE WHVF I FL I LP VYS YKKKK 
KKKKKKKKKKKKKKNKKDSQKKD 


12510 


26411 


A 


12630 


182 


3 


TNWLFPPPKFGPVPLFFLPPFFFFFFET 
GSHTVAQAGVCSGAILAHCNLRLPGSSN 
TPA 


12511 


26412 


A 


12631 


134 


473 


ASTIMDLLFGRRKTPKELLRQNQRALNR 
AMRELDRERLKLETQEKKI IADIKKMAK 
QGQMDAVR I MAKDLVRTRRY VRKF VLMR 
AN I QAVS L K I QTL KSNNS MAQAM KGVT K 
A 


12512 


26413 


A 


12632 


100 


462 


QLLLCCCCCRQWTAFNVPAS PAPP PAWA 
HMAPSLLEPPSLLVTQICKLSAFSGPSI 
NAFLLS KKKKKKKKKKKKKDRGGAPLKK 
KK 


12513 


26414 


A 


12633 


313 


441 


CWELLKWLVCFLCHI IKTNNCWAQWLTP 
VI PTLWEAE VGGS LE 


12514 


26415 


A 


12634 


129 


400 


VSQQCWPTPPALYSISRQALAASPQGRP 
WDLQPTMPESPLLPPRAPAWPKPLRRAL 
PPAPRCLGPSTAQGLRSAGAWCGTGRHL 
HLRPQC 


12515 


26416 


A 


12635 


308 


399 


E IRGWTQWHVPVI PALWEAEAGGLPEVG 
SS 


12516 


26417 


A 


12636 


285 


390 


I FGVL INS FI YFETESRSWRAGVQWRD 
LGSLQPP 


12517 


26418 


A 


12637 


260 


2 


GQGP F F FGGRGS P KKKP PL KG F F LVGG S 
KTQFFFPRPKKGPFGVFFFFFSQTASRS 
VTRL E CS GM I LAH CNLRL P G S S D S PAS A 
SR 


12518 


26419 


A 


12638 


76 


3 


ALQVQWLMPVIPAIiWEAKAGGSLE 


12519 


26420 


A 


12639 


219 


484 


LGLQEPDLDPKP ILELPLAELAQQLQTE 
ELSLESILCSYLKQALKVHQEVNCLMIF 
LGECEEELLALKKLKKSERGLLYGVPMS 
LKDT 


12520 


26421 


A 


12640 


33 


462 


EGLS WGYREHNGP IHWKEFFP I ADSDQQ 
S P I E I KT KE VKYD S S LRP L S I KYDPS S A 
KI I SNSGHS FNVD F DDTENKS VLRGG P L 
TG S YRLRQ VHLHWG S ADDHGS QH I VNGV 
S YAAE LHWHWNS D KYP S F VEAAHE PDG 
LAG 


12521 


26422 


A 


12641 


2 


414 


SGPAAPATPMS IFPELYFNVDNGYLEGIi 
VRGLKAGVL I QAD YLNLEQGETLEDLKL 
HLQ S TD YGNFLANE AS P LT VS V I DDRL K 
EKMAVE FRHMRNHAYQPLAS FLDFIT YS 
YMIBNVILLI TGTLHQRS I AELVPK 
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nucleotide 
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ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N—Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12522 


26423 


A 


12642 


161 


463 


KEKARGRPKPLLLLPITSATTAMGLTIS 
SLFSRLFGKKQMR I LMAGLDAAGKTTI L 
YKLKLGE I VTT I PT I VFNVET VE YNNI C 
FT VWDDGGQER I R P LWN 


12523 


26424 


A 


12643 


124 


2 


T VFGGQAQWLMP 1 1 PALWEAEAGRS PE V 
TSLRPAWPRTRG 


12524 


26425 


A 


12644 


258 


454 


NRPELCDYQGLREDSKRKGEINQTACQF 
LWPRQKMSSQTKFKKDKE I IAE YEAQI K 
EIRTQIiVEQ 


12525 


26426 


A 


12645 


636 


182 


APLERPSMIPPRCSGGQPSDLRSSPAQG 
TPRPALGDRKGTPRIAEPGGAGRAEAPP 
PRGSRQRFGPQRRQTNAHTKRVRTRLSA 
APARHSPAFIACRQVGRPRLSARRSGKK 
PHLTVWCPFSFQDPIQDTTLQI ILLSLL 
GGDTFSDFPCS 


12526 


26427 


A 


12646 


199 


414 


I YHTS RS LLDRKLRAQRYDELPH YGGMD 
GVGVPASMYGDPHAPGP I P P VHHLNHGP 
PFHAKQNYGAHAPHPN 


12527 


26428 


A 


12647 


130 


3 


DRG F LC VHTP L KKKKNTHTHTHTHTHTH 
TNTHTHSHTHTHTK 


12528 


26429 


A 


12648 


192 


443 


liLLGWAQESLGTLGENTASSHTAGLEWE 
SPLFACWFFWGFCLFVWLFVWFEPKSGS 
VLTLECSSVITAHCSLDLPGSRDPPASA 


12529 


26430 


A 


12649 


393 


179 


LHSRVQRSGCFCSCVSNPVTGGMHFTRS 
SPQSNQEADGQARWLTPVILAI>WEAKAG 
GLPELKS SRPAWATP 


12530 


26431 


A 


12650 


263 


2 


KAPLLVYKWE PNPGGGNFS PGMGTPNTF 
GVRFGGETKPLFFFFFETGSHSVIQAGV 
QLCNLDSLQPPPPSVKPSQYSWDHRKQS 
NTK 


12531 


26432 


A 


12651 


355 


111 


QDPCFIRLHLPLSFLSVFLECTKISLLD 
WRMLFPLLGDPPHI^AIiSSLVSP INTWNY 
SFQQLMMFRDVAVDFSQEEWECLDL 


12532 


26433 


A 


12652 


271 


3 


RQLKlSriLIFFSHKNMLSIKHIIYlSIYTLF 
LKSAF I FNLETDHLNCFPGS I SAKSYF I 
LFYFFETEPHSVAQAGVQWRDLSTLQPP 
PPRFT 


12533 


26434 


A 


12653 


333 


2 


PPSPGVFGNPPSPGDPLGKTFFFPRGPP 
PRFWGGGFFFILAPFPFGGLFPPFPLPP 
SQGPPFFPFFFFFFFFEMESRSVSQAGV 
QWRDLSSLQPPRARFSDSPLYFLEGR 


12534 


26435 


A 


12654 


310 


444 


TYLSNIFYFILFLVETESPSVAQDGVRW 
CGLGSLGPPPPGFGRFS 


12535 


26436 


A 


12655 


336 


411 


DQPGQHGETPSLLKIQKLVECGGAH 


12536 


26437 


A 


12656 


114 


2 


GIFFFFFGLESRSVAQAGVQLCDLNSLQ 
SPPPGFKQF 


12537 


26438 


A 


12657 


191 


1 


KNF I LT FS FRGAKNGD VFF P P P GYIQTG 
ENFFFFFFLKWSFTLVAQAGVQWCNLSS 
LQPPPPR 


12538 


26439 


A 


12658 


1 


445 


LRTGS E FSGRD S KGLiAAAE PTANFGLLL 
ASIEDQGAGGGGYCGSRDQVRRCLRANL 
LVLL TE AAG VAGVALGLGVS E AGGALAL 
GPERLSAFFFPGELLLRLLRMI ILPLVG 
CSLIGGANSLDPGALGRLGAWALL.FFLG 
TTLLASAL 


12539 


26440 


A 


12659 


2 


224 


ESTASRIAFEAWQPETLPKGLNYSGASP 
WLNAVLPLKKKKKKKKKKKKKKKKKKK 
KKKGGGVFKKKKI KGGGG 
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beginning 


ed end 
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od 
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E— Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ingto first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V— Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12540 


26441 


A • 


12660 


96 


1 


GLKDGRGQVQWLT PVI SAP WEAKVGGS P 
EVMS 


12541 


26442 


A 


12661 


1 


457 


CWVHLQKGCHFTLGRLHSGQAEALLTSQ 
TGRPGRGAPHIPDGAARQRRSSHPRRGG 
QAEALLTSQTGQPGRGAPHLPDGAAGQR 
RSSPPRRRAAGQRGSSPPRRWAAGQRLS 
S L PRQGGRAEALLT FQTGRPGRGAPH I P 
DEAAGQRGGAG 


12542 


26443 


A 


12662 


175 


351 


KG I F F F P P S WKAGGA I WFKGT FAP RGKG 
NPPPNPSGEGGPKKKKPGPQIPRGPPLG 
PPV 


12543 


26444 


A 


12663 


247 


401 


SSNQEPGCKGPCVLTNFYF ILGRAWWLM 
PVIPAIWEAGTGGSPEVRSWRPT 


12544 


26445 


A 


12664 


152 


251 


RENSCCFIKKKKKKKKKKKKKKKKKKKK 
KRGAP 


12545 


26446 


A 


12665 


2 


365 


APTRSRCLRHAS CTACRTVSTDTS SLRR 
AD PKGRS ALLAD I QQGTRLRKVTQ I NDR 
SAPQ IESS KGTNKE GGGS ANTRGASTP P 
TLGDL FAGG F P VLRP AS QRD VAGKE E F I 
LAPLWTVSP 


12546 


26447 


A 


12666 


179 


1 


LELSKSGSHWMRHFLSRSFYSLIS PQLN 
TTVWPTI ITPILLTLFLITQLKILiNTNY 
HLP 


12547 


26448 


A 


12667 


279 


35 


NQAPFKARFFTTFKNVFLKE IKKKRCWS 
GAPPIIPPPWGGGGGGPPWGGNLSPPGP 
PVSTPFFNGAQKKKKKKDRYGGACL 


12548 


26449 


A 


12668 


299 


399 


IS IKKQRKGPVQWLMPVIPALWEAKAVG 
SPEVR 


12549 


26450 


A 


12669 


1 


385 


KMQNRGFFFLTFLALLGLTSGAAKKKNK 
GKKAGPGSKSPNWPWGPCPPSSKDCGGV 
FRKGTWGAQTHGIRGRGPCNWKKEFEAN 
CKSKFKNWGAGDGGPGTKVRQGTLKKAP 
S I AKGQET I RVTKP CT 


12550 


26451 


A 


12670 


357 


1 


AGKIAKI CPVSSMQAPTCGFPVGGNDNQ 
GQAPDGQ FQ P PLQQNQTS S PDFSNENS P 
ATPPNEQGQGDAPPQLEDEEPAFPHTDL 
AKLDDM INRPRWWPVLPKGELE VLLKK 
GIDFSKK 


12551 


26452 


A 


12671 


107 


373 


TGNYTPLEDCAQEQMRLIAQVHLLETRV 
KQQQVKI KQLLQENEVQFLDKGDEMTW 
DLGS KRQ YAD CS E T FNDG YKLRG F Y KI K 
PLLNP 


12552 


26453 


A 


12672 


196 


45 


YLYFHGSREKRLAKKYYDKLFKECCIAD 
LS KYKENKVC FPH I YGKRFLFK 


12553 


26454 


A 


12673 


33 


448 


KEGAERAGAAAPVWS FLLDRRDS ARTRS 
GT S LGS ADMGDMKT PD FDDL LAAFD I PD 
IDANEAIHSGPEENDVPGGPGKPQPCVG 
SESEHTASASAGDGPGVPAHASDHGLPP 
PDISVLSVIVKNTGLSRAGWRPWVDV 


12554 


26455 


A 


12674 


250 


29 


KKQTNMWQS PYED YRI F YTISS SLYHLS 
IYPPIYLS I YLSI YLSLYLPTHLS I ITD 
YVSILEHELRPLWRIQP 


12555 


26456 


A 


12675 


377 


442 


SDRQWWCMPIVPATWEAEAGES 


12556 


26457 


A 


12676 


1 


553 


RIFPGRRFRVKLPSCPDPAMGTRLLFWV 
AFCLLGADHTGAGVSQS PSNKVTE KGKD 
VE LRCD PIS GHTAL Y W YRQS LGQGLE FL 
IYFQGNSAPDKSGLPSDRFSAERTGGSV 
STLTIQRTQQEDSAVYLCASSLATAWHS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














RLLPAHKPHPSLSLQLLDTLNRGFSLKL 
PPHGKQVDLDLSLSFG 


12557 


26458 


A 


12677 


232 


3 


GWSSDCQQRARVIEEKHSRQKGHIAENP 
SVHRHHQRPKVDKTTKMGKKQNRKTGNS 
KTQSASPPPKERSSSPATCI 


12558 


26459 


A 


12678 


134 


3 


IFIVEFGGKPFSCTSISLSQWLWCLFIG 
IGELLWGQVSTGTLY 


12559 


26460 


A 


12679 


212 


47 


NGTHPHGYTLHTHKHTHSCAHTHTHTHS 
LLLVRKS FLPQKNTT I S FRS CCKLGKP 


12560 


26461 


A 


12680 


224 


2 


MKMASFLAFLLLNFRVCLLLLQLLMPHS 
AQFSVLGPSGPILAMVGEDADLPCHLFP 
PMSAETMELKWVS SSQCI 


12561 


26462 


A 


12681 


35 


397 


KPLISQNKIHSVSSSQTGFFFFFFFFGE 
KKP P FAPRGGGKGP PP PLP PGGGNRAP P 
RGEKKGKGKPPPKNPGAFPPQKEKKTPG 
GGGG KKKT PAP P T P KE VD Q PLP RRE QRG 
PGRLPPRGA 


12562 


26463 


A 


12682 


188 


514 


HTTHLVYVLSMAAFFFFFFEKNFLFAPR 
VE KRGKD LGS hKL P P PGFRH F S GLT L QG 
SGNNGAPPPSPVI FLVFFEKRGFPLVGR 
EGLILPPLQPAPFCISFRGAINGPS 


12563 


26464 


A 


12683 


366 


3 


I S QTQTQWTDWVS LELQGCE LAHRDTVA 
IPQTRSFPFPLMISCMTLFSLLQMDRAF 
PPFPPQTPITIQGQISTSPATWPLTHLH 
STPGLSVEYSSNTHKSSLSPTSQWIRVD 
PMLASPTV 


12564 


26465 


A 


12684 


99 


1 


SS C I RFFLEQAYGQQS YGT YGQPTDVS Y 
TQACI 


12565 


26466 


A 


12685 


247 


31 


FLKE KRJSTQ F YL F I FQRWLDPNKPIRKQL 
KRGSPYSDNFRVKFFVSDPTKLQEEYTR 
WVYGY I FLEN I VKTL I 


12566 


26467 


A 


12686 


416 


253 


PS PMT I PVTGAPRDADLWS SHAKMLAQP 
LKDSDVEVRFLGSSQIFLCWRPSFNL 


12567 


26468 


A 


12687 


116 


3 


CLLGRMWWLMP VI PALWEAKAGRSLE VR 
SLRPARPMY 


12568 


26469 


A 


12688 


380 


1 


SPESSKIiGIiWSKHVSPLLCIGVCPPPVS 
MAELRQVPGGRETPQGELRPEWEDEVP 
RS P VAEEPGGGGS S SSEAKLS PREEEDL 
DPR I QE E LEHLNQ AS EE I NQ VE LQLDE A 
RTTYRRILQESASV 


12569 


26470 


A 


12689 


157 


488 


REFVSQGGSHLKAQVRLEALLLTIGTPP 
WAHLS ILHMTAMGQRENKREAASSRIQV 
LLNHSQAIKQESI ILFFEMESGVWPRLQ 
YSRMISASCTLFLSGSNNSPVSAPRI 


12570 


26471 


A 


12690 


514 


3 


PS I RAGLiLCGSAENATP FLCG I TMAAGP 
LYTYPENWRAFKAL I AAQ YSGAQVRVLS 
APPHFHFGQTNRTPEFLRKFPAGKVPAF 
EGDDGFCVFES IAIAYYVSNEGLRGSTP 
EAAAQWQWVSYADSDIVPPASTWVFST 
LGIMRHNKQATENAKEECMRPLRVSSLV 
GP 


12571 


26472 


A 


12691 


304 


4 


AESLRVAHERLDTRSTSSDIFNFPQTQS 
NLEMNSEILjESWANYQSSTSYSINTELS 
LFSKVNGKFSTEFQRMKTLQVKDQAITT 
RVQVRNLVYTVKINPL 


12572 


26473 


A 


12692 


221 


2 


VGPATRDLCFADE P WGRQKQKQWWWVL 
CLS PNRGLAGRPDTDHI T C AS AHMRTHT 
CMHTHRHAHAHVHTHCI 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12573 


26474 


A 


12693 


3 


205 


YMLCSILSTQEAFSICLGEKFIYLFTYL 
FRDGVSLLLPRLECNGVILAHCNLRLGD 
RARLHLSKKQK 


12574 


26475 


A 


12694 


233 


3 


IMGPAHLFKGNHWNGGWGLMYKAFFFLF 
FFKTGSHS VAQAGTI PVHCNLRLPGS SD 
S PAPAS RVAGNTNVPHHTLY 


12575 


26476 


A 


12695 


408 


3 


PAREMEKFRVCVRKRPLGMREVRRGE IN 
I ITVEDKETLLVHEKKEAVDLTQYILQH 
VFYFDEVFGEACTNQDVYMKTTHPLIQH 
1FNGGNATCFAYGQTGAGKTYTMIGTHE 
NPGL YALAAKD I FRQL E VS Q PV 


12576 


26477 


A 


12696 


123 


631 


REAVQGGKGSGGGFGSRDSRSSGAVSAA 
VGDMGDPGSEIIESVPPAGPEASESTTD 
ENEDDIQFVSEGPSRPVLEYIDLVCGDD 
ENPSAYYSDILFPKMPKRQGDFLHFLNV 
KKVKTDTENNE VS KNHCRLS KAKE PHFE 
YIEQPIIEEKPSLSSKKEIDNLGASDCW 
D 


12577 


26478 


A 


12697 


429 


629 


LALLYPLKWVGKPVNFMEEDILGPLPPP 
LNEEEEEAEEEEEEEEEEENPVHKIPDS 
HE I TLKHGTKT 


12578 


26479 


A 


12698 


158 


404 


L PLLS VERLVE I YAYT F I QDL FS ASNI P 
YS S SVFRIEMDQPRTHSGPTTASNPAP S 
S TNS S SAPSATNS KQERS S S SLSKPS 


12579 


26480 


A 


12699 


185 


3 


HRRP I S IPSS VIHPAMVRLGLQYSQGLV 
SGSNARCIALLRALQQVCPILLSLMIQP 
HLVY 


12580 


26481 


A 


12701 


181 


3 


TVWAS MFLS AALRARAAGLAAHWGTHVR 
HLHKPDMQNGAGGALFVHRDPRENNRDT 
PCI 


12581 


26482 


A 


12702 


373 


1 


I L VQE RD S QRV I RM I D KL VS S RDRVGRG 
VE KQMTDHR V I Y S VRKLQ KR ITPEPFIP 
SRTTPKDLFLLVLPSGKTPGHPVSSRTI 
PEPPLPTEPLERIPEHPVPSGTIPKPPE 
PPLPIEPHETMY 


12582 


26483 


A 


12703 


272 


508 


TKLGKKQNRKTGNS KQQSAS P PPKERS S 
SPATEQSWMENDFDELREEGFRRSNYSE 
LREDIQTKGKEVENCEDNLEE 


12583 


26484 


A 


12704 


341 


1 


VPQHPHRPLAPPSLGPQSWPLMEGSRPR 
SSLSIiASSASTISSLSSLSPKKPTRAVN 
K I HAFGKRGNALRRD PNL P VH I RGWLH K 
QDSSGLRLWKRRWFVLSGHCLFYYKDSH 
V 


12584 


26485 


A 


12705 


187 


365 


WEPSCRGPSAFKPTRCQSYDWACMCGAE 
GRSAMEQPQEESPEVREEEEIEEMAHAE 
GTP 


12585 


26486 


A 


12707 


238 


596 


LWLVRVKYSIMSGAALGLE I VFVFFLAL 
FLLHRYGDFKKQHRLVI IGTLLAWYLCF 
LIVFILPLDVSTTIYNRCKHAAAJSTSSPP 
ENSNITGLYATANPVPSQHPCFKPWSYI 
PDGIMPI 


12586 


26487 


A 


12709 


131 


2 


DRVSLSPRLECSGTILAQPSRLKQSSHL 
S LTGS WD YTHAAP C I 


12587 


26488 


A 


12710 


151 


2 


YRQGLILLPRLECRVMIMALCNFESPGL 
S D P LT S AS RVAS TTAT CHHT C 


12588 


26489 


A 


12711 


151 


1 


NFPEFDHFTVGCLRAGSSLIHYWWECKL 
VQSLWKAVWRFIKDLKIDLPLY 


12589 


26490 


A 


12712 


28 


411 


RWPAR PAGE PRE PHVS WVMKLNP QQAP 
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E=Glutamic Acid, F=PhenyIaIanine, 
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K=L»ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LYGDCWTVLLAEEDKAEDDVWYLVFL 
GSTLRHCTSTRKVSSDTLETIAPGHDCC 
ETMKVQLCAFKEGLPVFDVTEKYFLLPR 
MYRYYRCLYTYPSAAV 


12590 


26491 


A 


12713 


351 


675 


ENPRHTFIYSLPLGLQKHQVLTVDIGFG 
GTAI MT VGKS S KMLQHID YRKRWI LQDG 
RIFIGTFKAFDKHMNLILCDCDEFRKIK 
PKNAKQPEREEKRVLGLVLLRGEN 


12591 


26492 


A 


12714 


326 


457 


ACSHHQRKSQAPPPAHLWPDCPHPGSLP 
LLNP S VCNPGRE VS P T 


12592 


26493 


A 


12715 


200 


1 


TITKVCVFTRTCGLKGSLMAKALFDQRK 
DRESETE CQ VWWLT P VI P ALS E AEVGGL 
LEPRSSRPGV 


12593 


26494 


A 


12716 


208 


405 


KKERENKKHTNKKGRKKM VH I C Q WHS S L 
HRKFQG I YRKTTGTG WAQWLTP VI PAL W 
EAEVGGPPEV 


12594 


26495 


A 


12717 


157 


3 


AKETPTHKGSCKKKFFFFFFETESRSVA 
QAGVQWRGL.GSLQAPPPGFTPCI 


12595 


26496 


A 


12718 


1 


472 


SPAILPRLAILPYLLFDWSGTGRADAHS 
LWYNFTI IHLPRHGQQWCEVQSQVDQKN 
FLSYDCGSDKVLSMGHLEEQLYATDAWG 
KQLEMLREVGQRLRLELADTELEDFTPS 
GPLTLQ VRMS CE CE ADG Y I RGS WQFS FD 
GRKFLLFDSNNRKWTW 


12596 


26497 


A 


12719 


537 


1 


LjHTMNGGNE s s GAD RAGG p VAT S VP I GW 
QRC VREGAVL Y ISPS GTE L S S LE QTRS Y 
LLSDGTCKCGLECPLNVPKVFNFDPLAP 
VTPGGAGVGPASEEDMTKLCNHRRKAVA 
MATLYRSMETTCSHSSPGEGASPQMFHT 
VSPGPPSARPPCRVPPTTPLNGGPGSLP 
PEPPSVSQACI 


12597 


26498 


A 


12720 


369 


3 


AAKI I PATRKKAS LELELPSSPDS TGGT 
PKATISDTNDALQKNSNPYITPNNRYGH 
QNGAS YAWHFE ARKS Q ILKCME CGS SHD 
TLQELTAHMMVTGHFIKVTNSAMKKGKP 
IVETPVTPV 


12598 


26499 


A 


12721 


230 


3 


KFF W VLAGLTGKNS DAS AS LNQVHIS P F 
LFSRHHPFSLGPLSSPVLLQGSKRRQLL 
ATLRALESASLSQHPHVPV 


12599 


26500 


A 


12722 


26 


110 


REQYAEGNMRGPAPGKKTSGLQQKNVEV 


12600 


26501 


A 


12723 


224 


1 


WQNQDQASDPPKYS FTSQCCLS FARLAR 
RYGDVFQ IRLGSCP I WLNGERAI LQAM 
VQQGSAFADRPAFAS FRV 


12601 


26502 


A 


12724 


120 


530 


KKVARGRS RS RE R S RRRRS R P KA I T \NR 
T*GTRCTPRRWRSTVL>GMRSCTRAR*QR 
RSGLSRGHTRSAGLCDHGSVRAGSGDGA 
DGTGGGDRRLGLGRD S LLLS S S QS AAF S 
SSASGSSSFFSATQPLRMLLEYFWL 


12602 


26503 


A 


12725 


434 


222 


KEEKSRGEVKEQD*EKDREEEVEKSRE/ 
RRRS RRRRRRS E V * YRKGKRRRE RED I L 
VAERSHRS FRNS I P S TL 


12603 


26504 


A 


12726 


268 


370 


SNILIHMESMFFFFGTDIPSV*KNLPA* 
KTPGPDGFIDKLYITFRGELTTFPHILL 
H*FTEGAVLFNSFSK\AASITLTPKPNY 
DIMRKENYSPISSYIWNQCSFFLEQIFP 
LSFKLNCPGLIIPH 


12604 


26505 


A 


12727 


103 


636 


VCFISMKQPHGSRHPPCS/PPRPGS*VC 
QQSRPLGPQPSAFGQLA*HRRGLLGLQG 
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M 
eth 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PRSPMQPVPHGGRVSGQTLPKGPTPKRP 
PGPAPRHGGHSWLLK*LVCAQPPPRP*S 
*ASQAVFTLQVPGKPQWTPCPVPVRAP 
TP\PLSNGGLGVSERH*DGH*QAPTSP* 
PRCQGGAGEGPQ 


12605 


26506 


A 


12728 


1193 


1638 


GLGFAMLPRLALNSWPQGDPSVPASRAA 
GTRGVHRHTQLQVS FNYYKVLAMHSGSQ 
L* SQHFRKPRRSDHLRSGVRDQSGQHGE 
T P S L / L KNTR I S WAW WHT P L VP ATRE AE 
ARELPE\PGRQKIASEPRVKPICTASLG 
NTCETPPQKK 


12606 


26507 


A 


12729 


304 


101 


RHLHWPCPSPLAPTPDISHCPEPPKSQT 
SP/CPGT*CLGFLKCPSSCHFA*K/PLH 
PSEPARKSLAPDTCP 


12607 


26508 


A 


12730 


91 


264 


S QRS I S GLRVKENL I F VM I MFP P I YS S Q 
TFWSQTFLMLKIVFMGISIS ICYIL/NT 
E KNLGQG WWLAP 1 1 PALWE AE AGGLL * L 
RVKENL I FVMIMFP P I YSSQTFWSQTFL 
MLKIVFMGISISICYILKLRKI 


12608 


26509 


A 


12731 


1026 


406 


LAHFRSQI FS FSHI LVHFERMVLNYRYY 
liACVRRWFRFLHVIjAYFGR I VLKYR Y F 
LVHFRREVFRFRHMLACFRRMVLYCYFL 
VLRKI Lli I YVT FLT YFRKVFLRHNFLRG 
GY*RKYFGYAPRWLSFFLYRCQCFLHYF 
LFYLWLRLHLQF/ CCFWSFCLFDFLFL 
FS AACVFS VLQ I AI VMFPFHALQLFFF * 
WLiCFSNSRFQ 


12609 


26510 


A 


12732 


1508 


157 


QD VGG S G F KVD TH P RGRMAS I FS KLLTG 
RNAS LLFATMGTS VLTTGYLLNRQKVCA 
EVREQPRLFPPSADYPDLRKHNNCMAEC 
LTPAIYAKLRNKVTPNGYTLDQCIQTGV 
DNPGHPF I KTVGMVAGDEES YE VFADLF 
DPVIKLRHNGY* PQG* LKHTTDLDASKI 
T \ QGQ FDEHYVLS S RVRTGRS I RGLSLP 
P ACTRAE RRE VENVAI TALE G F KGDL \ A 
GRY\YK\LSEMTEQDQQRLIDDHFLF\D 
KPVSPLLTCAGMARDWP\DARGI \WHNY 
DKT\FLIWINEED\HTRVISMEKGG\NM 
KRVILSR\FCRG\LKEVERLI\QERGWE 
F\MWK*AP*EYILT\CPSNLGT\GLRA\ 
GVHVRDPQSFSQDPTAFLKILEKPRTPR 
KRGHKWWDIAA\VADVYD\ ISNIDRIG 
RSEGEL\VQIVIDGVNYLVDCEKKLERG 
QDIK\VPPPIjPQFGKK 


12610 


26511 


A 


12733 


261 


487 


TGSETDCAKPQASFLPREVPTVAEMKME 
LLKNKQFW\RGWAHACNPSTLGGQGGW 
IT* GQE FETSLANMAKPHL Y 


12611 


26512 


A 


12734 


382 


668 


YKRITD I FVDSETVHILINKRQS CRI PG 
F I QLVQL I SHQLAAPRD YTVSHS VAQAG 
VQW/RNLGSLEPLPPGFKRFLCLS *HAI> 
KNLSS CDTPPQY 


12612 


26513 


A 


12735 


401 


27 


GDRAEESAEPRAWSHSDNSHRYTTLFIC 
LTHTHVHNP VHS \ HTHTHTHTHTHTHTH 
TVSHRHTETPPLLLKQTGLKFY*NSRDD 
TPRSRPGSSGLQRLSSSPPVPFQPGTVE 
ASADFCGHDLLTT 


12613 


26514 


A 


12736 


202 


182 


KYLP IFINLI TMNI L FF FETHS CSVGQA 
EVH*S*LKPMPPG\SSDSPASAFQVSGI 
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sequence 


SEQ ID 
NO: of 
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26 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
XMUnknown, *=Stop codon, /-possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














TGSPSSSWLIL*FFLKIGFTKKG 


12614 


26515 


A 


12738 


319 


2 


QNTP PHI YE YI PRVDAFLLFLEMS YS VY 
WAVLS PLYF WKLALTLHCRLALNS FL 
HKI QEPS \LGSGSG\PLSCNT I I *RLCL 
RSLHLVFRSLWILPICDSSSVF 


12615 


26516 


A 


12739 


1363 


2000 


DRVFVIPGWKCSGTIKVHCSLDLPGSSN 
PSTSAP*VAC\ATGTYPPCPGLNFLYF* 
ME ERGFTQVA\ QAGSQNS WGS kpngts e 
AS PKVLGMNKP *NLPNPGRS ILN*DGS I 
RYKDNYPA/ WE P P KRS YKCLRQKKS I * L 
SAGPRDYEASQPRKNYP I S SLLTFCTS S 
LHFNPTELTCSSQLQKSIRRKLKSEESC 
PPLPACS LKHTQAI I KI T F * RIiHRTA/ L 
FYLLK*NCFRN* S * * KYFLLS VSNLKI / 
Q W VRS S LH P GAPG Y TP T V * RP KHKSNVL 
I * I QS / I TRCLF YI LVYMRT I FPLQLFL 
SKHPSTYKEDFA*VLPLSFFFFEMESRS 
AARRSLSSLQPLPPRLKRFSRLSLLSSW 
DYRCAPPGLASPSILSS*GSCCSIQFIL 
RMST I SHAINVLVLKNT YLVLVSASEHS 
LIKKPC 


12616 


26517 


A 


12740 


1696 


743 


GG Q I MRS GVHDQ PD QHGE TL S IjIjKI Q KS 
AGPGGMHL*SQLLRRLRQENRL\NRGCS 
EPR/SRHCTPAWATEQDSVSKKIKK*KK 
*NHLESKQQGQPALEPPEQAGGQLRKTE 
QLQDGRRELAADMTL* PGHSKRLGALPR 
PLLSAYYFNKVHAARRQAYLETPGTFTS 
YQ RE EGE QAL I LGVE AQAS S P T VFHRE R 
RQSQALCSHRKSQEAPVRPAHPPRRVPG 
LGKPSSQGLSAHLGQDRAPAPRRASWDQ 
RSQAPISVTFPSVLDKEESVPCGPGFPG 
HAPAPRGIHGATWEGAHSRGYPGHFLAL 
PQHNS DEQRPN 


12617 


26518 


A 


12742 


2473 


445 


RGARRSRRRSRHRRRRHQSRPVRAAPRQ 
PEQGRRRGAPTHGPQLIMMDLELPPPGL 
PSQQDMDLIDILWRQDIDLGVSREVFDF 
SQRRKEYELEKQKKLEKERQEQLQKEQE 
KAFFAQLQLDEETGEFLPIQPAQHIQSE 
TSGSANYSQVAHI PKSDALYFDDCMQLL 
AQTFPF\VDDNEVSSATFQSLVPWYSPG 
HIESPVFIA\TNQA\QSPETSVAQVAPV 
DLDGMQQDIEQWEELLSIPELQCLNIE 
\NDKLVETTMVPSPEAKLTEVDNYHF\Y 
SSI PSMEKEVGNCS \ PHFLNAFEDSFS K 
HPLHKNDPNQLTVNSLNSRM/PTVNTDF 
G* WNF I LVF \MAE P S I RQQAWP S P ATLS 
HSLS\ELLNGAHGCFLDLFTL/CKAFNQ 
NHPEGTA\EFHGF\DSGISLNTSP\SVA 
SP\EHS\VESSSYGDTLLGLSDSEVEEL 
DSAPG\SVKQNGS*NTMYSSSGDM\VQP 
LSPSQGAEHFTCMDAQCEEHTRGKDLPV 
\SPG\HRKNPISQKDKHSSPLGGLISQR 
DEL\RAKAL\HI P I PCRKKSFNLPVG\D 
FNEMMS KEQFNEAQL \ ALI R \ D I RREGV 
RNKS/AAASGICRKKENWENIVELEQDL 
DHLKDEK\EKLLK\EKGENDKSLH\LLK 
KQLS TL YL \ EVFQHAYRDEDGKP YS PS E 
YSLQQTRDGNVFLVPKSKKPDVKKN 
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SEQ ID 
NO: of 
nucleotide 
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NO: of 
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NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K— Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, --Stop codon, /=possible 
nucleotide deletion, \= possible 
nucleotide insertion 


12618 


26519 


A 


12743 


6 


457 


RPRNRPG I RVD PRVRGRVGHAPHEGLVP 
ERMYRGSPTACETQAAA*ERAFGPSSPT 
C\RLP I PRMSTS VPQG&TWTQRVKKDDE 
EEDPLDQLISRSGCAASHFAVQECMAQH 
QDWRQCQPQVQAFKDCMSEQQARRQEEL 
QRRQEQAGAHH 


12619 


26520 


A 


12745 


297 


12 


Q ER P VRML VLHFGRLRIADHL KLG VRDQ 
PGQ YGE T P S LLK I P KLSGM \ MWAP * I C 


12620 


26521 


A 


12746 


98 


428 


YNTSFNETVLLLTTPITIDCLYTRKDAI 
APESACGVCPCSLLGVGA* IPKWIRLD 
LSKKHVTTAYGGFMCAKCVLDRINRAFL 
IDEH\KIVPKV*KALSQSSKAFCEHET 


12621 


26522 


A 


12747 


3 


638 


LWLWS LCWVWAVS LPLQF I LGS LHPCQG 
QASWREVDLLHEGSQEASSKPSSGSVPV 
. GLLLDQEMVTPLLVCICGI LLHQEMVS P 
LLVYTRG I LLHQEMVTPHPGL YPWDS P P 
LDGHSPPGLF\RGAAPPS*GG/PLPVLV 
YTRGAAPPSGDS\PPPPGLYPWDT\PPS 
GDRHS P PGLYPWDTP PSGDGHS P PAL YP 
WDSPPSGDSHSPPGLFPRG 


12622 


26523 


A 


12748 


2 


333 


DMVLLCHPGWSAGS I *KKTKERERMMNL 
HR**IDRERERERERQIQ*KE*KRINEF 
EGSKERFTQSVKQRMK/ELEKIRTLDTS 
ETIYLSIYRN*SLRRGQERRTERERMRH 


12623 


26524 


A 


12749 


30 


333 


KTS YLLP VQWWAQNDNERYSS S KNTIMA 
LPLPLPVFPRSPSDAERKLDCSAAISAH 

CNLPA\DS pasacrvpaiagarrha* lv 

FGFFWWRRRFAVMAGLVS 


12624 


26525 


A 


12750 


231 


39 


INDLL / CLF* FKKL I WGRVW * LMP 1 1 PA 
RWE AKAGGS F E P R S L KLQ * AM I TPMYS S 
MGGREQDPVS 


12625 


26526 


A 


12751 


49 


273 


HLQVTEVFWFWCVFFRRWGGSHCV/AQ 
AGV*WLFTGTWPCCSPELLGSRDPPAS 
AS * VAGTTGACLAANCRGF 


12626 


26527 


A 


12752 


505 


897 


SYLRVQFYSAFSLFFRHEVLPCHTGWNA 
W* S QLTTASN/ CLGPSNLLKAPHLAN* 
KKIF*RQGLSMLPRLVLNSWPHMILLP* 
LPE*LGLQARATAPGSGFTFCLTQDSIL 
MCSPTVHKLSDS IALETKQ 


12627 


26528 


A 


12753 


335 


542 


CCNEFFLSQVWWLMPWPATR/SAEAGG 
LPGPGS QRLR* ARS KP VNSHCSAGGRCG 
IDP I S I KQQKNNNR 


12628 


26529 


A 


12754 


356 


72 


WHEYYLLMEHTKKCHLS*GLYNGLNN*Q 
/WCTHPWPATWEAKAGESLEPRSSRL* 
C TM I T P ANS HC P PAWATARS C LNQ S I NQ 
SSNNWQGMITGGK 


12629 


26530 


A 


12755 


340 


127 


NYLF I YFRDSLTML PRDECSN* FTGS 1 1 
VHITLKLLGSSLP/ASASQVAGTTGTRH 
HVQLSNYFKIKIKPKP 


12630 


26531 


A 


12756 


290 


511 


KKNQPGT/ CGSRL* S QRFGRLRQADHL \ 
DQLGQHGKAPSL/LKNTKISWALWRTPV 
FPSSQEAEMEELIEPTSSRLQ 


12631 


26532 


A 


12757 


254 


549 


YP GAKKQR P GGD S VRGTHLQ S RCW C VLL 
HNCQ * P KSNTQT F P * P KKTLRE * NAT * KS 
TDL/WPGAAAHLCNPSTLEG*GEWIT*G 
QE FETSLGNI PRPQ I Y 


12632 


26533 


A 


12758 


617 


451 


NKKRRGLTLLSRLKCNGVIIAHCSLKYP 
GS S E P PTS AF* VAGI T Y\GT * HHAWAY 
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sequence 


M 

eth 
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26 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


12633 


26534 


A 


12759 


91 


393 


KWYTKECLYLLVS I FSRYLLS I YYREVL 
AQTLGKLS KTVMCPALEETVAWGLQI K 
SKK* /WLGMVTHACDPSTLGG*GGWIAW 
GREFGTSLASMAKPCLC 


12634 


26535 


A 


12760 


10 


221 


QSHNTNGCLDLHFQKGPWQSVGETLNL* 
TIFLCVGVCVCVCVRHVYHM/CYIYSDF 
ICPSIECEYFSFAHR 


12635 


26536 


A 


12761 


309 


55 


I IRYKNYI S IKLEGWKRLYHATTHEKKV 
GVAVL I S E KVDFRAKN/MYQDKEGRFLL 
*R 


12636 


26537 


A 


12762 


640 


896 


TIVLKCNFVLFCFRDEVPLCCPGWSAVA 
/QSQTQLW*TTQPGLKQSSCLSLLSSWN 
* RHVP PHLAMFLKTTFTLL YTCGHLETN 
IG 


12637 


26538 


A 


12763 


1 


357 


IVPLHSSLGDRVKPHLLKKKKTCSTSLV 
I RETQ IKITRYTYSNG* S * RKNKAGNNK 
C WHGYTATGTL I AD / CKSKQMLWKTF * Q 
FHIKLNIHY*YNSEIQFLGLYPRDKNIA 
RCGGLRL 


12638 


26539 


A 


12764 


129 


320 


RWKCVKLKNTGQAWWLIPVI PALWKNQ \ 
AEGSLEARSLRLQ*TLIAPLGYSLPG*W 
SGTLSLNK 


12639 


26540 


A 


12765 


617 


435 


GVKGLILPKLQKLLKSIE/I/E*KLPKS 
FYDASLTMIPKLHKD\RPISHVKIEARI 
LHKISINII 


12640 


26541 


A 


12766 


243 


5 


Q KI KKR I KKNKQNLi * E I WD YVK* PNL * I 
IGIPERGEKVYNTENIFEGIIQ/ESVPN 
L / AYLVRE VDNQ IQEI QRTPVRYYTK 


12641 


26542 


A 


12767 


101 


16 


RTQLFVS VFWFCFLFCFETGSCS ITQVG 
VQWCHLRSLQLPPPP\GSRDSPPPSSQN 
RG*TNI 


12642 


26543 


A 


12768 


362 


92 


RVPHMDLGEECTS PYMYI * PMT CVFKHG 
* DGNFC VT YFAT III YTY I / YLSVCVS I 
*I*CYLCVCVYIYIYIYIYIYIYFIYKY 
MKERRL 


12643 


26544 


A 


12769 


1161 


824 


QAKDLS THFTKENM * MG/ NKQMKRLSE S 
LVI S EMQ I KTTMRS HLTHTR I T 1 1 KMTS 
DKICP*RCG*iy!RA/LLQG*WKYKIVSPIi 
W KI VW P FL KT \ INMN I S CNLM I L FLGF Y 
PIY 


12644 


26545 


A 


12770 


241 


237 


K*RIWA\GVMAHACKPSTLAGCDGWIA* 
SQEFETNLSNTGRPCLYER 


12645 


26546 


A 


12771 


326 


6 


DTVSRKNKSGKIFQLSSRV* I YERSQSG 
VKVYKCKTFGKAFTQ / HF * AHMRM YTGE 
KPYKY*ECGKFFILVLLLLLMIQKYFHL 
IKIVRLYLIRKKVSCKQPSNKILQS 


12646 


26547 


A 


12772 


413 


1 


KKTFYFFFFETTQAGV* *CDHNSPWP*P 
P \ GS DNP LT S AS * MA \ GLQS CMHHHAQL 
S * FVFLEAGSHYVAQTDGDSPTSASQSA 
GITAVSHHTRPILYILIVLSYAISFFSI 
LYS AYFLQ PHLP IHNNFLFS Y I S S S C 


12647 


26548 


A 


12773 


2 


1055 


FFFFLRLNLTLLPRLE/CVILAHCNLHF 
PG * SNS PAS ASRVAE I TGSRHQAR 1 1 FV 
FLVEMGFHHVSQAGLE/SPDLK*SAHLG 
L P KG * D YRRD I T PGHH * S F * * R YFLKT Y 
ISRVL/WCSLLGTFMFMIAFDHM*TW*H 
*QITHFHFLLQIWKLISERLRSAQGQQAL 
KTVTSTFSSKYVCRAFLAEPGWIAIIjYV 
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peptide 
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Predict- 
ed end 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=XJnknown, *=Stop codon, /-possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














TF*SLDSRRTCLWAVSNFFYSTLFLKL 
RQQVLLTG / P YHLFS ARPS S CGSDKKAR 
ELNLSLSL\CFFFFGKRVSGTQAGVQ*H 
DHSSLQLWPSRLKQCSHLSLIiSSWDYRS 
ITPHLANF/ CNFLSGWSRTPGLK 


12648 


26549 


A 


12774 


315 


98 


FRCFLNSINPLESPHVRKTCAGRAQWLA 
PVIPPFWEAEA\SQMPQVIHSR*LEARS 
SRPAWPTWQKPCLKYKN 


12649 


26550 


A 


12775 


275 


59 


NPSPYKIQKI S WAWWYMP WLATR \ RLQ 
*AEIAPLHSSVGDRARPCLPKKQKTNKQ 
TKTSGEPVLVQGPATVC 


12650 


26551 


A 


12776 


73 


381 


SLLNSELSSIPRPTWESPSTSDSAWPMQ 
NLASEPLPLP*TTPPPDFPALSA/PE/P 
LPFTPSPRDTLPPPLSPSYKQAPVLSVH 
PPVMPLKYVPS S PLLLS FLYL 


12651 


26552 


A 


12777 


274 


342 


LNCVIA*WLMPI ISALWEAEVGVSFA/ P 
/RSLRPAWATW 


12652 


26553 


A 


12778 


2 


478 


TIYYTKYTTFRVPNLGLQLLTAFFCVCV 
GVSLNGFKNAKDYGGSHLKYSVYITGFL 
LQ FS LKFDCFLC VC VC VFP * MA* KMP K I 
M/RGSHLKYSVYMT*FLF* FSLKLKYFC 
DPILRKRI/WPGAMAYTCNPSTLGG*GG 
QIT*AQEFETSIjAKWKPCLY 


12653 


26554 


A 


12779 


680 


453 


SETDWRKNKFFPSNFPSNLRTNSWTNSC 
SKKTFKKH/RVGLGVLAHT CNP S TLGGR 
GGWS P * GQEFENS LTNMVNHFS 


12654 


26555 


A 


12780 


59 


487 


SLHKHPERGLQS FQGCWTQEMLGGSHVQ 
QRAWEL CAP PNLT C P G YL S F FLRQDQ AL 
LSKLECSGTIPAHRNPSLPSPQLKVSSH 
LSFPSSWNSRCTTPHQLL,KLFF*DRIRL 
FCPSWSAVAQSQPTVT/SSLPSPQLKVS 
SHLSFPSSWNSRCTTPHQLLLLFFNFL* 
RQVFTMLPRLVSNS * VQAILPPWPPKVL 
RLQA 


12655 


26556 


A 


12781 


109 


361 


LSPWHHSRNYTLIYPYTLNFWMSAFPTP 
KEEFFFSFPETRPHSVAQAGAHW\AI IA 
HCGLDLLGS \SDPPTSAS * AAGTTAI I P 
G 


12656 


26557 


A 


12782 


3 


204 


LIDGSLALSC*LFCGGAIIAHCSLELLG 
LGQ\S PVSATRVAETTGVCHHAQLAFVN 
LHLRTRS KHCGH 


12657 


26558 


A 


12783 


2357 


6366 


LTGS \NSHI T I LTLNI /NMGLNAP I * RH 
RL\ANWIKSQDPSVCCIQETHLT\CRDT 
HRLKIKGWRKI YPS PMGKQKKKAGVAI \ 
LVSDKTDFNPTKIKRDKEGHYIMVKGSI 
QQEELTI LNI YAP \NTGAPRF I KQVLSD 
LQRDLDSHTLI \MGD\FNTPLSTLDRST 
RQKVNICDTQELNS ALHQADL \ I D I YRTL 
HP\KSTEYTFF\SAPHHTYSQNWTTIVG 
S KALL S KCKRTE 1 1 T \NYLSD\HSA\ I K 
LKLR I KNLTQNHS TT\ WKLNNLLLND Y\ 
WVHNEMKAE IKMFFETNE /NKKTPTYQN 
FWDAFKAVCRGKFI \ALNAHKRKQE\RS 
KIDTLTSQLKKTREAKSKQHS KASRRQE 
ITKIRA\ELKEIETQKKTLQP\LKKISE 
SRSWFF\ERINKIS\RPLARL1KKKREK 
NQIDT I KNDKGD I \ TTDLTE I QTT I RE Y 
YKHLYA\NKLENLEGMDKFLDTYVSLPR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M— Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
. X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LNQEE VESLNRP \ ITG\ SAI VAI INS \L 
PTKKSP\GPDGF\TAEF\YQRYKEELVP 
\ FLL KL FQL I E KEG I LPNS F YE AS 1 1 L I 
PKPGRDTIKKENFRPISLMNIDAKILNK 
ILAKRIQQHIKKLIHHDQVGFIPG\MQG 
WFNIRKSINVIQHINRAKDKNHMIISID 
AEKGFDKIQQPFMLKTTNKIGIDGT/YY 
FKI IRRAIYDKPTANIILNGQKLiEAFPLi 
KTGTRTGMPSLTSPLLFNIVFCPIVFWA 
R\AIRQEKEIKGIQLGKEEVKLSLFADD 
MIVDLENP I VSAQNLLKL I SNFSKVSGY 
KINAQKSQAFLYTNNRQT/EEAKS*LMS 
ELMSELPFTIASKRIKYLGIQLTRDVKD 
L F KEN YKL \ P L I KE I KE D \ TNKWKN I F I 
PCLWVGR I S \ IMKMAI L \ P KVI YRFNAI 
PIKLPMTFFTELEK\TTLKFIWNQKRS\ 
RIAKS ILSQKNRAGGITLSDFKLYYKAT 
VTKTAWHW YQNS MVL VP KQR Y I D QWNRT 
EPSEIIPHIYTILI\F\DKPLLEKNKQW 
GK\DSL\FIKW\CW\ENWLA\I CRKLNL 
\DPF\LTPYTKINSRWIKKKDLNVRPKT 
I KTLE ENLG ITIQDI GVGKDFMS KTPKA 
\iyiATKAKIDKWDL I KLKS FC/TI CTAKE 
TTIRVNRQPTKWEKIFATYSSDKGLISR 
I YNE LKQI YKKKKKTTPS KKWTK\ DMNR 
HFSKE\DIYAA\KKHMKK\CSS\SLAIR 
EMQ \ I KTT \MRYHLT P VRMAI I KK\ S GN 
NRCWRGCGE IGTLLHCWWDCKLVQPLWK 
S LWRFLRDLELE I P VDPVI PLLGIYPED 
YES CCYKDTCTRMF IAALFT I AKTWNQP 
KCP TM I D W I KKMWH I YTME Y YAA I KND E 
FMSFVGTWMKLETI ILSKLSQEQKTKPR 
IFSLIGGN 


12658 


26559 


A 


12784 


787 


926 


PQAIRRPRPPKALG*HNSVDLGWAWWFT 
PAI PTLWDYTHEPLYLAKI S * CFKKVYK 
FVLNCIQNCPGPHAARHS/SGKTS*V*S 
N YL F F S F C FE T KF CS VTQARLQ WHDLNS 
LQPPPPGFKRFSCLSLPGGWDYRRLHTR 
SANFCIFSRNRVSPSWPGWSPTPDLRRS 
AVLGLPKRWDYRREPPCPAKIYTIMAPQ 
KVNSHSSQPVSFLS 


12659 


26560 


A 


12785 


248 


285 


C*VIF*CVMILYVLIYVEKITSIRLAVY 
W/CIITRLFVK*ICSCLLGC*FLFFNV 
C*LYI FI IFVFFFFFFFFFFFFFFFELN 
ILY 


12660 


26561 


A 


12786 


243 


37 


RRS AGHGGS CL * S Q YFGRPTRADHKVRS 
N / RGE TLS LVK I Q KVS QAW W * E P WPAT 
RQAEAGADAWVDR 


12661 


26562 I 


A 


12787 


75 


289 


DYRHEPRFFFLYTFFEMDSHSVARAGEH 
WCDLGSL*PST\*GSSDSSASTSR\QRH 
HAWL I FGFL * RRGFAC 


12662 


26563 


A 


12788 


83 


256 


RKMYVVGLFVTAQN*K* PKYPSTEE/YI 
K/RLWYLHTMEYYSAIKIEKLGTRAQNR 
LILY 


12663 


26564 


A 


12789 


197 


499 


QSLKSVEETVFRNNKQ/ SIPTFQVTLMP 
WWLIPII*ATWEG*GRRSPLNPGV*DQP 
GQHSKTPSLQKKTKFS *AWWRVPWPA\ 
IGEAEVSGI PLSPERSRL 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ n> 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A Ian ine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V==Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12664 


26565 


A 


12790 


394 


91 


F YNS TKFKWKNI QFL I TRCRGDQ I QVI R 
AHAVQ LGFHVL YF / C L S QNL S I YL Y I S I 
YVYTYRYIHTHMCIHII*YLYIYTYIVI 
KH* KEHWW I KHCFPTERK 


12665 


26566 


A 


12791 


735 


75 


FFFRQEGFLSPPKRGGVPPPHTKKYFSP 
RGSFLWGWGTSRPPPRKCFSPPPPPVFL 
PPPKKKKIISFSPTKLAPPPEYFKSPPP 
PS\PPPPSSPSPSTFFFFKNFPFSPSSF 
SSPPFFFPSPAWEDPPVQSRRIY*FLPP 
PF* I PE I PR\RVLKKKPGEGRS WGLFPP 
PKGPKRRVPLGPEFKTRLGHKTKPRFFK 
KKKKKKRFL I S CLWWHMPAI PSTLGG 


12666 


26561 


A 


12792 


482 


342 


MGVFVFFFETGSHSSVTWAGVQ*CNLSL 
LQP/PASSDPPTSDSRVAGST 


12667 


26568 


A 


12793 


405 


169 


FYKKKKTGSGLALS PRLECSGTI IAHCN 
LELLGSRSLRL\SNTPTLAS *AARTVSV 
CHQTHL IF* FFVETGS CYVASA 


12668 


26569 


A 


12794 


270 


94 


YVCYSYKQIKICSHVKRRE*NTFKIYLL 
KYIALGLTGHFFPL* TFSKTVTLWPGAV 
AHACNLS PLEGQGRWI I /R/GQE IKTIL 
ANM* IPVFLHVNI FLS VCSYS IHTTLDY 
LFYYSCYILISL 


12669 


26570 


A 


12795 


284 


415 


NF I K I I KKKVMGLF YGKTFLLNLKKKKR 
KKK*WPGTVPHA*NPST\LGGLGGWITR 
GQEFETS TLATN*NPTKS ISRGF 


12670 


26571 


A 


12796 


1387 


32 


AP SS FAIRS FFSGPMNAFFS SMVWKRPW 
PNLEWSMNLRS I FS RARRFVCTS KDLR 
RVSTRFLVHLTAFLGYKAGMTHIVREVD 
RPGSKVNKKEWEAVTIVETPPMVWGI 
VGYVETPRGLRTFKTVFAEHISDECKRR 
F YKNWHKS KKKAFTKYCKKWQDEDGKKQ 
LEKDFSSMKKYCQVIRVIAHTQMRLIjPL 
RQ KKAHLMGDQ VE RGAP VPE KAD \ WAP R 
EALS S KVLVTQVFWAG I K* SNF I GGDPR 
AKGYKGGQPVCWAHPRKLPPQRPHPRAL 
RKGGPVLGAW \ HP \ARVA\ FS VARRWGR 
KGLPFHRTEINKKIYKIGQGYLIKDGKL 
IKNNASTDYDLSDKS INPLGGFVHYGEV 
TNDFVMLKGCWGTKKRVLTLRKSLLVQ 
TKRRALEKIDLKFIDTTSKFGHGRFQTM 
EEKKAFMGPLKKDRIAKEEGA 


12671 


26572 


A 


12797 


86 


364 


EQDNRIFFSFLSLFFFFGETEFGPFAQG 
GGQGAILGPLKPPPRGL\SSFPT*GSQE 
VGTTGAPHD I F CF FNKKGET PRLYKKNK 
NNRGGGATP 


12672 


26573 


A 


12798 


101 


440 


HC S R YATG I FE W YS G I LVL I L LRFG I YE 
PMNLNIGDPYDSPPSRLYKMRLWERMAK 
SLIEVSLKNSHFWLGMLAHACNPRTLRG 
IGGRIA*TWGF\KTSLGNIARPHLYLKK 
I 


12673 


26574 


A 


12799 


364 


195 


NPPGDLGEETLLWGGEIMGTTPP*FPPP 
KEGFLPQDPPGGFNSPP I /QGRS FS FPP 
PGKFGPPQGFFKRPPPFFFFLNNNQ 


12674 


26575 


A 


12800 


86 


489 


PTAMAEEG I AAGGVMD INAALQELLKTA 
L I HD GLAHG I RKATKALDKRQAHL C VIA 
S NGD ET VYVKL VE AL CAEHQ INL I KVDD 
/NKKLGE WVGLCQIDREGKPRKWGCS C 
IWKDYGKESQAKHVIRENF*CKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


12675 


26576 


A 


12801 


64 


211 


gnhkksnyslapww yttwpat * eae ag 
gsle\l*camitpvnshssla 


12676 


26577 


A 


12802 


300 


3 


NNVTTGSNIQFHSFPLIRKDICVSYIKN 
DAWNY I KNSNPNGE TQVYQ *YSFSN*NL 
KIWLGKVA\NPSTLGGHGGRIA*A*KFK 
TSMGNIVTPCLINKQK 


22677 


26578 


A 


12803 


208 


394 


SQHFERPRREDCIiS\QEGQYSEIPSVQK 
NVKI S WAW * CMPAI PATREAEAGGPLE P 
RRSRLE 


12678 


26579 


A 


12804 


183 


348 


AF I LGKGK YE P YN P C LGWA * W CM PWLA 
SWEAEAGG\SLEPRSSEL*CAMLIGCLH 


12679 


26580 


A 


12805 


397 


238 


ETEFPFCCPGWKWGDLGSQQPLPPGFKR 
FSCFSLPSK\WGQ*SLAPVGKGCK 


12680 


26581 


A 


12806 


1 


293 


PTRPRTS YEKQGYLLLPPVFS IVLEVLA 
RAI S QETE I KS VQMGKEE VKL S L \ FVCV 
YIYA*ENPVESTKTVDANLFDTRDWFHA 
I FPQTVGE VGMVLG 


12681 


26582 


A 


12807 


297 


281 


aknprrqpreipggvfptpgfppyfkak 
mpegpfpggfffkggvlwgtpppfffff 
ffsetesrfvtrXlqcsgaisahcklrl 
pgschspetc**r*g 


12682 


26583 


A 


12808 


255 


59 


AGCSGSQLFGRPRGVDHLSPGV*DQSVQ 
HGEILSTKNTKISWWWSVPWG\EVGR 
SLEPGR*RLE 


12683 


26584 


A 


12809 


247 


310 


PliHSIIiGTPTTPAPTPGKPLLIRTSILY 
* KP I L S QAWWCAP LVPATQ / E AE ARG S L 
EPRDSRRS*AMIKPVNSHCTPSWALPRP 
LPQPLESHC 


12684 


26585 


A 


12810 


101 


477 


IGKEE I KLLLFPDNLME YTVNF * MYKPL 
ELiIS*FSKVTEYQVNTEK/SNCIYT*QL 
Q I ENE I AKT I * FM IAS KS I KYLK I SLTK 
CN\ KWRD ILCICI GRLS I I KVL VLPKLM 
CGGEKI FNPNPTGFW 


12685 


26586 


A 


12811 


1022 


696 


C F FVL F F KEMGS H Y VAQAG VRW CDHGS L 
YPPTP\GSSDPPTSAS*VAGTIGAHHHT 
WLIFKFFVETRSCFLIQAGLKLIASSYP 
S Q PRT S P KCLG F TD VKS LHLAWIi C P 


12686 


26587 


A 


12812 


166 


143 


RKEERFQ INNLGFH VEKLE KEE Q I \NPK 
TNRKKKIIKTRV* INEIEYRKTIGK\lN 
KNKSWFF*KINKIDKPLAMLDSGRRDSN 
Y*NITKSIEEGRRRKEEAGKRKEEGRKR 
RRRWRQRWR 


12687 


26588 


A 


12813 


163 


423 


KAFWEF INHKKTGP F * RGP IGKVS P PGG 
NLASSI*KG/DPFPSPLiKGPQGSSGGPP 
PKKP PTGFNLAFGGLRGRETKRGP FFLA 
GLLP 


12688 


26589 


A- 


12814 


2521 


2842 

- 


INTMYFPP*EMLVGWAWWF/RASNPQHF 
GRLRQADRLRSGVRDQSGQHGETPRLLK 
NTKISWAWWRAPVIPATWEAEAGESLEP 
GREGCSEPRSCHRTPVWVTRMRLY 


12689 


26590 


A 


12815 


173 


395 


LDQPGQQRETLSLLKKKQTNKQTKKKNG 
FKKFL*LDTWVHTY\NPTPLGGRGGGFF 
* GQDFETTLANMGKRGFY 


12690 


26591 


A 


12816 


43 


346 


CVYVS /GVCVCVCVCVCLAS VCMYVCLH 
L*VLFGIFYV*DHNICKLRSFCFLFKNT 
FSHVYF*KIIWPGVEL*VTVFLFLSSLL 
HTSCVYEKSYAILIFSAL 


12691 


26592 


A 


12817 


647 


931 


SQHFGRPRRADHLRS GVQDQPGHHGETP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginme, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PLLKI \QKLAGRGGA*L* SQLLGRLKQE 
NRLNPGIRSCSEPGSHHCTPAWATE*DS 
VXjKKKI gad ky 


12692 


26593 


A 


12819 


392 


2 


PLFFFFFFFLETESHSVTQAGVHNPPGP 
S KP PTSAP * AAGTTGMH YS AHAQ/L I FF 
L*TELPSCRPGCSAMAVHRHDPFTDQHG 
SASFLIWASSPLLRQTGDSSLIiGGHHVD 
AKFSVDTQIxALCTKAQNS 


12693 


26594 


A 


12820 


1183 


870 


DRVSLLTPRLGVAVARNLAQWKPLPPGF 
KRFSCLSLPSSWDYKHTPPTPG*FH/RG 
* FF VFL VE TG FRHAS Q \ AGLNS LPQSDP 
PTLA/ F P KCW \ D YR \ HDHLAW PRKM 


12694 


26595 


A 


12821 


172 


298 


I YLS I YLPI / YLC IHLS IHPS I YPYLYL 
SINPSYWSFSREL*LT*L*YLSIYLPI 
IYVSIYLSIHPSIHISIYPLIHPIGFLF 
LENFD 


12695 


26596 


A 


12822 


51 


254 


YDFLKKHLLSSYRAKDLESLFLLHLHIF 
SGN* S VS YI CVCVCVCVC 1 CVCVYI C I L 
I KS WCS * S /CGVL * AI CAGVGWC*LI FG 
VIi 


12696 


26597 


A 


12823 


199 


668 


QPLPAPSPSAQKGRAHPTPSPARPQAE/ 
S LP S Q SLGTRTRP PLGRP P * S P P WGQVD 
LiWGWLGPSPTLPAPESLSPASLSTEWL 
CGTRYLVWGRSGAGQA 


12697 


26598 


A 


12824 


33 


442 


GRGKLLKKSQKQRKNALLKYDS KTRVGT 
LFHWTS VPVTRPLLL* INIRLIKLDVI K 
CSIHKIFHSEFLFFKQHSGL/CI*LSK* 
LCS * KIHQS GLGT VAHVCNPSALGGQDR 
RIT*GQEFETN*WMVGPCLYPPLPP 


12698 


26599 


A 


12826 


363 


75 


HWEPRPAGPTLPSAPGSPSGRPCPPHTT 
SG/PPPPNTHTPMSRSPWSWEGSDQRPQ 
PPHTDTGSPPN*GSRLRHERPPWGRERQ 
RPPPTTLVRPLHSH 


12699 


26600 


A 


12827 


76 


486 


CTVNL F I Y FEME S CS WQTGVQWCDLS S 
LQPLIWFSCVPTQISS*IPMCCGRDLVE 
GP*MMGAINSFLCCSYDSK* VS / ARSDL 
FFFFFFFF 


12700 


26601 


A 


12828 


345 


672 


KMKCTADLSLLEKDT*IKLEENR*QERI 
MIjRAE VNE I ENK I EN I NKTGGS S S E KY Y- 
KIDNPLTRSIRRKVN\KITSIRNEKGSV 
TTNPTE I KRTVKE Y IMKNFMP INLTT 


12701 


26602 


A 


12829 


208 


1 


PSNSTP I FKRNENI CPHKDPCTLMLMAA 
L F VM S KK * KQL KCS SNDE \ KLW Y I HMME 
* YI AI KRNKLL I 


12702 


26603 


A 


12830 


649 


172 


FPLKYSLSTTSVMAPSFTSCGGSQQVLL 
APLSRLLAWPNV/HPPPQPRHPAPHSFL 
S P PRS P PNYSP * S S QGPLLQPQS KHVRP 
L P QANARL P S AKNP ARGP PGAAQAP RDR 
PPA/VCPPLPPHSPLRHERAPRLFLPLP 
ATEPAALHAVHPGRRQERVT 


12703 


26604 


A 


12831 


1068 


667 


KLSWFFCSRLRLHFGS I FKVPSL/ PVPK 
LLLP I PS LCPSLPLQLQPAQPVP FNTAT 
WliCFC* / P * RV* PAPGYQRVGHFNFFPL 
QLNFIPIILQ*HLGF*KSASLSSSSSPK 

kvslchpgcsavwsahcnlkl:lis*sa 
clglpkc*dyrlqsprpp/ptifkvtkm 
cllkqltgsvsf 


12704 


26605 


A 


12832 


268 


1 


KCFFLLALNWPEFILDYFILNW*NSLRT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E— Glutamic Acid, F=PhenyIalanine ? 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














VTEKFLFMTLFLTY*NMNRPGMVAHACN 
PS / TLGGSGGRIT*AQES * TSLGNI VI P 
CLYQKL 


12705 


26606 


A 


12833 


654 


321 


LVGIRFLKVKYFNIKFPHLLAIISPSLE 
VFNLQHFYIYFLLFIYF*ETRSCSVA\R 
LECSGVIIVHCSIiKLLGSSNPPALASQV 
AGT I TAHHCAQQHFCI LEKSNL CT YF PL 


12706 


26607 


A 


12834 


867 


201 


RQMSPTLRTKSRDVASRSSFSAWAWGKL 
FTFWDFSMSLVSLQGPGGPFHSPQP*TQ 
GR*VENNQEPLATPFPGPERSLPGQPRIj 
EPAHSGKRRGLPLEIRPPVNVSDIHGDP 
SGAQWIuVRG VEAG P WAVGGQAP * SEGPS 
PSPPLDVSPDPGSSLPRSSSP*MDLRA* 
LSLTF*TFQKGSGN/DPASSPA\GKPG* 
EWRKI KGPPQGLLLGPEMWGSQARVLS 


12707 


26608 


A 


12835 


328 


1365 


YAFCLMMLDKKQI \*AVFLFEFKMVHKA 
AKTTQNI SNTFGPGTANKHTVQWWLKKF 
CKEESLEDEERDMGHGKVDNDPI*EP\Y 
*TT*KITEELSVDYSTVIRVVQHSKQIG 
KVKKLDKWVPHE LSGNQNYRFE V* SSFM 
LRNNNPFLNRIVTCNEK* ILYN/RPAQW 
LGP RGQ L QS Q F P KPNLAP KKKRS L VHW / 
SPVGL!LAPILDPTYSFLNP/GGETITSE 
KL C S G KLGKMHG KLQ YLL P ALVNRKGP I 
\LLHDNT/RDCVFAQPVLQKLKELG\YK 
VLPHP PYSPDLS PSDYHFC * HKDNFLQG 
KH/ SQHGAENAFQEFVKS * STDFYATGI 
NK/LFSHWQKCVDCNGSCFD 


12708 


26609 


A 


12836 


7 


328 


rrererererererererererererer 
erererererererererererererer 
ererererererererererarv*ekst 
rykspppripplrr/rrv*rkhpharal 
s f fl ykkctggeraps p vc vggkft w 


12709 


26610 


A 


12837 


219 


350 


pfnhtyatws iii snvqvcfmraqd i yl 
liylfeagscsvaXrlecsdmimlhcsl 
dlpgsnnpp i sat * 1 


12710 


26611 


A 


12838 


189 


359 


LGLDGVMRVGPCRALGPS *EEKSSRVQW 
LTPVI PA/ LLEAEAGGS PEVRSLRPAWP 
TW 


12711 


26612 


A 


12839 


368 


77 


KNPNFLKFGSKPMGPPIYSPP\LEG*AG 
GFIWPGF*TPPGYMGKPPFFLKYSNLPG 
LAAP GGC S P F P GGLGRK I S FT P E ME VS I 
NPGSPLSLPPGEQN 


12712 


26613 


A 


12840 


16 


173 


ktdvhs ktctklftaalfl i vkkwkhlk 
pp*vd/einkmwynhiveyylaiks 


12713 


26614 


A 


12841 


425 


45 


NSFVFFFFLFENRVLTPVAPRLE\CTGV 
IL/APHCTLPALPEFKRFLVPSALLSSW 
DYRPVPVMPWLNFCIFVETGFH\HVAQS 
VLKLVSLKL/PSPTWDVPKC*DFRYSVR 
CCGLFF 


12714 


26615 


A 


12842 


237 


375 


LliLVWYILKKLW \LGWAHACNPSTLG 
GQGGWIL*GREFETSQVNM 


12715 


26616 


A 


12843 


21 


325 


TSFFFLESYSVAQAG\QWCDPGSLQPQP 
PG/ SQ/DNPPTSG* VAVTTG/MHHHARL 
I FVFFCRDGILL YCPGRS KI IFSSCIRE 
LFKKISDFFLLTMLIFCNSNKG 


12716 


26617 


A 


12844 


369 


58 


PISPLQFMLPFLKNQSPYGPFFWKKKNF 
PPPFLGGAVLKTPGF*NFLFKKIQRGFP 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=PhenylaIanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














PFPKKKKPRERPKPCPPKFG/ SPPFFFY 
PGFPPPKKGPPPPFFFFFFF 


12717 


26618 


A 


12845 


211 


409 


NALKPKIHFFFSGGGLKGIVWGNTLLDI 
GLHKTFFF\*GDFYYAL*I*AENALFGG 
GGDCAPHLQVK 


12718 


26619 


A 


12846 


200 


33 


QKTNNTKCWQGWGATGTL IHF * GE FTRV 
QSHWKIVWKFL*/SLNILP/PSSSSSSS 
SSSSSSSVKNLCLHKACTWMFIAALLLL 
AKTWKQP * CE 


12719 


26620 


A 


12847 


416 


125 


KIPTRPGKGGPPLYPRPFRGLNKQIGLT 
P EFGTPLGNKGKS F P F FQKNKN \ NWPRG 
GGPIGPPSPGG*VGGLTLKGEVSTKLDS 
PPAPPPGGQKWPLP 


12720 


26621 


A 


12848 


85 


397 


KIHIFFIFLVSLLKGLSFLLTFSKN*VF 
/SFIDFLILCVCVCIEFYCFILIIiFHYF 
CLFGFCLFLMFVS * CQSLDFYFNS F / YI 
F*YKIQCYKF/CILIF 


12721 


26622 


A 


12849 


1 


246 


R PRRHL VAS L VLTQLNGS S PLPIKIISW 
AWWLMPVIPTFWEAEVGGLLEPRSSRQ* 
* AMI \ S P CP PAWAT 


12722 


26623 


A 


12850 


161 


1 


NKTTSSQIW*PMPIVSTT*EAEVGGSLE 
PRS /L/ KLKCAMIS PVNSHCTPTWTTK 


12723 


26624 


A 


12851 


65 


244 


S T YYLHS LPFLD I S YKWNHTT C \ SRYFM 
GALFL I AI K* KQ P KR P S ADE Q I * T / 1 MW 
YIYTLEYDIiAIKRNEILIHAATWMNLKN 
I MLS KR S QT QKT \ H W* FHL 


12724 


26625 


A 


12852 


235 


34 


TTIAVSVLILKELMEP*TLQPDFLG/WS 
LITES VDHVQWHAPVI PALWEGEVEGLL 
EPRSLRPAWATW 


12725 


26626 


A 


12853 


168 


453 


CQLRGASGTQGPGLSESQCQQPCP/PEC 
RPSKPRPCGCRI*SPARTSPQPSPAASM 
ALPTNERTPPPALKW*PK/ gpkqspqsa 
KS KS P VKS TERTAK 


12726 


26627 


A 


12854 


244 


3 


ELQVYMHI * I YI / C * S I YTKANI YI KYM 
HMC VY I YTHTHTHVC I HLH I YLLRHG FG 
C WLMLG KVRG CEGMAGLM KAQQGWGW 


12727 


26628 


A 


12855 


376 


428 


KWEPGGHTS FFFFKGLVLNFGRGVLQKT 
RP*GGGQGGKFNPNFSGPREPPPP/PPP 
GGGKKGPPPPPGVFVFFLEKGGSPILPR 
GVLNPGAPKGVYTQRGGIKSGNQGATPL 
SFFLKVWF 


12728 


26629 


A 


12856 


129 


395 


APPNTHPFLSEAPQSLSLEARPPSAPSN 
SSSR/PPVWRASQTHPPQDWSSHCSPPW 
GAPPQI *PFSDSLAHPPQDWSSHCSPPW 
GAPPQI * PFSDSLAHPPQDWSSHCS PPW 
GAPPQI 


12729 


26630 


A 


12857 


363 


97 


GLATLRRLVSNSQAQVIRPPQPPEVLSL 
PKC*DYRPGRPPASLIHVNSRQT*KVAP 
/CSKPLAAAAHQGSPGATETTRRCPPSR 
CGPIF 


12730 


26631 


A 


12858 


322 


361 


KSEKQS * VMLAVCTLDMMKMT \ F I S WF 
LPKTHN/LMSNYKKNTRQIPMEGHSTIY 
LTRIPQNCQGHQKQ*KSEKQSQQESKKR 


12731 


26632 


A 


12859 


88 


417 


HFTFFFFFFFFKRGGGFFSPRLKRLGKN 
FFFLDP P P PG * RDFS P S PFKEGGF * KP / 
SPPPLVFFFFF*KKKGFPLGAGGF*TSG 
PGETPPLYSPEVWFFYGGTPGPTPFFF 


12732 


26633 


A 


12860 


217 


2 


LIL INKLANMHCFL FVFETS LT * AGVQW 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Giycine, H=Histidine, I=Iso leucine, 
K=JLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














HDL/GLLLRLAPPRFKQFFCLILPSSWD 
YRCALLHPATREDEA 


12733 


26634 


A 


12S61 


375 


145 


LSFFFFF* KPGS CS VAQAGVQ WHAHS S L» 
QLQTP \GFSDSLNSAFSVARPVRVHCHT 
RPLSPFNICWRARPKQKVFL 


12734 


26635 


A 


12862 


525 


852 


NLLSIYFFETESRSVAQAGVQWCHLGSL 
QPLPPRF* IKKPSNTALFM*NLLSIYFF 
ETESRSVA\RLECSGAILAHCNLCLPGS 
SDSPASTSRVAGVHHHSHLSSWDYSCAP 
PCLANFFVFF*VETGFHYVSQDGL/DTS 
* PQVI CPSRPPKVLGLQA 


12735 


26636 


A 


12863 


3093 


204 


EPDKTGPVLWKVGGGARVPGMAETLSGL 
GDSGAAGAAALSSASSETGTRRLSDLRV 
I DLRAELRKRNVDS SGNKS VLMERLKKA 
IEDEGGNPDEIEITSEGNKKTSKRSSKG 
RKPEEEGVEDNGLEENSGDGQEDVETSL 
ENLQDIDIMDISVLDEAEID\NGSVADC 
VEDD\DADNLQESLSDSRELV\EGEMKE 
\LPEQLQ\EHAIEDKETINNLDTSSSDF 
TILQEIEEPSLEP\ENEKILDIL\GET* 
RSEPVN*ESSELEQPFAQDTSSVGPDRK 
LAEEEDLFDSAHP \ EEGDL\DLASEST\ 
AHAQS S KADSLLAWKREPAEQPGDGER 
TDCE PVGLEPAVEQS / S VAASELAEAS S 
E E LAE AP TE AP S P E ARDS KE DGRKFD FD 
A\CNEVPPAPKESSTS EGAD \ QKMSS PE 
DDSDTKRLSKEEKGRSSCG\RKFLGLVG 
L S STTRATDLK\NL FRQ I WGRVGGAPRL 
WTNARS PGS FAVYG FVTMS TAEE ATKC I 
NHLHKTELHGKMISVEKAKNEPWKKTS 
\ PKRGSDGKKEKS SNSDRSTNLKR\DDK 
CDRTDDAKRCDDESVEKSK\DKNDQK\P 
GP SERSRATKSRKS RGPKRT W\ MDKS K 
\GVPVISVKTSGSKERA\SKSQ\DR\KS 
ASREKRS WSFDKVKEPRKSRDS \ESHS 
RVSERSEREQR\MQA\QWEREERERLE I 
ARERLAF\QRQRLERE\RMERERLERER 
MHVEHERRREQERI \HRE\REELRRQQE 
LRYEQERRPAVRRPYDLDRRDDAYWPEA 
\ KRAALD E R YHS D FNRQERFHD FDHRDR 
GRYPDHSV\DRREGSRSMMG/SIREGQH 
YLERH\SDPEPH\GQDSLR*LGWGYEL* 
Q * RLS \ EGRGLP S F P PGAGRDWGDTWLE 
DEDDPVMGKGTAERGPMMDR\DHK\RWQ 
\GGERSMSGHSGPGHMMNRGGMSGRGSF 
APGGASRGHP IPHGGMQGGFGGQSRGSR 
PSDARFTRRY 


12736 


26637 


A 


12864 


2 


456 


IHLGSGSEGDGSSGGLGRGNSNTSRFSS 
SSWARGDNVPRPPPPAVCSWISEGDVQN 
PGLGEAGAGS STPGDGGELRY / WPGLLG 
A*GRGIGGDDDDLRTLGLAGVQQCGKG/ 
RRGPRGPGRGQEPRRPRDLGLRGPRRAQ 
GAA/SATAAPPP 


12737 


26638 


A 


12865 


388 


1 


L I KMTKVKKTAITKC* RGWGRTGAFIHC 
SWECKMIQPLWQTVWQLLKKLNIYL\WK 
QPKCPLM/VKWIKKLYISMDSYSAVKKK 
ETLMYTTA*MTLEVALLSERSQTRKGER 
GHILWDS I YINFQKMQTNL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


12738 


26639 


A 


12866 


198 


3 


KKQKQTKNP PKI S * ALWCVPWPATWGA 
KVEGSPE\PRRSRLQ*AMITPLHSSLCN 
RVRTRGRTRG 


12739 


26640 


A 


12867 


415 


973 


NPVNCAQFSLLPTTEYMGHRVEGATGHG 
TCP S VP PNTHLHTGWWCLQHHRS CRAWG 
RGGSHTHRCPQRVPDGEHP * I YIAVHGE 
PP/ESPPQPSPLSCCPPQGNYIALREPP 
QGLPLPGTLPSHPPPFWHICKTHSSESR 
HPPFPGFCGLELEKGVGDFI IHPPLTLP 
KPLPNPLKPEPTPTPQTHI 


12740 


26641 


A 


12869 


715 


893 


PCVAPHSIREERFRLTSESTNQKVLWWG 
ID*T/VILILTGNWQKRHLKSFFLGQKP 
GGMAP 


12741 


26642 


A 


12870 


342 


58 


KKKKP KENEQS I WDMWDNFRTS DI * TTG 
VLEEQFRE *GRKI I FKE I VRENSSNLTK 
/Q INLLTQEV* TTT / HKNMNKS TPIHI I • 
IKLMKTTTKEKI 


12742 


26643 


A 


12871 


110 


401 


SFYSVMILKASRIVICLFT*LYAYIHTL 
YI YTLC I YYAYYVNX YICTHI \NIHI I I 
HVYT YYI VYAS I YYS MYS AW YN I KVS KD 
RQTYENIALITYY 


12743 


26644 


A 


12872 


439 


256 


S YFMRFGGDKI S KLYQF I LS RRWKQS TC 
PSMDEWINKMWH/ITIMEYYSAK*RNEY 
SYMRHE 


12744 


26645 


A 


12873 


356 


43 


GTPERHSHSHS QGS P I JGAFMGI RAS F V 
YSLLS CVLLMLS PQRVLFYFS AFKQQQS 
\WSGKVAQAYNSNTYGKAGSIT*GQEFK 
TILGNIARPCFYKKKFLANS 


12745 


26646 


A 


12874 


341 


205 


RE I KKMVLNKKFFS LQFLGKKKNFWEFF 
LKKKKKVFKAP P VFQ F* VP IF/CSF*KK 
KVFGFFFSPKI FFFFFFFFFFETESTS V 
TQE FNGT I LAHCNLGLQGS S S S P AS AF * 
VAVS QD CAI EL LGDRGRL CLE KKKKKKK 
KKNFGGKKKS KNL F FL KR KKMGT 


12746 


26647 


A 


12875 


474 


206 


PIRANGPVFSPPPPPPAPGTCPHPLVIM 
LSRHLRQSHPHTTVQPWTSGLLPSPPA* 
LSLETQHFKTAKFIFP*H*SSGLGPGWG 
SPVGEGPG/HPKLCLQPAAAAEAWPPRE 
GTKARRHEGLPPAPCAWGPAPRDAGSRL 
EFPTLPTIPVSQSPGR*HGAGGRPSCLR 
AFVPSRGGQASAAAAGCRQSFGDLALLP 
LGS P I LGQGQNFSARGR 


12747 


26648 


A 


12876 


257 


467 


YRVFTLLPRLEYIMAHCNLDLGSSDP/ P 
QVAGTTLK* LSRASRVAGTTGVHHHSWL 
I FK/ Y FL * RRWRRFS L 


12748 


26649 


A 


12877 


867 


1409 


GPTVLVGGQDWNSVLQQPRAHSFTPRGG 
AGGLGGGYVLHKPIPFSQELLGAFPCQK 
Q YRVP P PLEAQLLRQARS PAF I Y * AAVL 
CQAQGPSPSPCSPMVSPRPSGRAPHLGA 
APQSGQKALASTS S PDAEPV/ SVG * WCP 
S NP AL QGGQNGAE PRAVS ITS GAL S LYD 
CFLDGWWEGRSLGP 


12749 


26650 


A 


12878 


1416 


740 


IRNPRWVQLDVFAAAPMGRSSPLFLLLF 
SFLKTVLRGSMIPSTVPS/FHKGPQACL 
YSLSLRS*SQAAPSPAPSSLPTPSPHPG 
LTQPGRGRRKADIS/CSHSWAFYVP/SL 
PHLPWACPLSLPQPVLFSILFCFVLF*D 
GVLLCHPG\WSAMARSWAHCNLHLPGFK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














QFSCLSLPSSW\DYRCMPP\GLG*FLEF 
*VEDGGFTILVRLIWNS*PQVIHLPQPP 
KVLGLQA 


12750 


26651 


A 


12879 


961 


31 


PSPVRDSCVAWAGERRGRLGGSGVWRSH 
ARGPGLMRGLPASEPAPCVLGSRPCFRV 
AFETPWTSPASCMFERCPLWRPPALVAA 
GLDP FLLSS TLTLAVCPRDL I RSWKKGC 
D VCD S RNCWS TH S RPACRDAI LAP I LAG 
RGE C I GLWG / AAAACTGRV/ T P VSGLTF 
STVFDFSSARG/V*GGLPRSTGGPRVRG 
PGQPVPLPGLPAGVALAPLEGRPQEPAP 
AL S H S L TGS L FLGAWL CG C TRHGRAP VL 
LEVS I VACGQLCEAAPQI FIQPGLQQLP 
TPSATSPAYLWVDVLLTLPSVIKQKRVI 
SFLTKE 


12751 


26652 


A 


12880 


232 


28 


PAIiWVPWARFDGMEETGQNHWSKEKAW 
PGAVAHTCNPNTLGG*GGH/MT*GQEFE 
TSLAXTMAKPRI PP 


12752 


26653 


A 


12881 


331 


1 


KNGPFFKNPPEFFPPPKWGVPPNPTPYF 
PWPEPIF*LTPGPLKKQKIPFQ\KGEIL 
GWGGKMGNPLPFWKWGGPPRESKSSPRF 
YKTPPPI FKKKKKGGSSRSRTSPRV 


12753 


26654 


A 


12882 


370 


287 


VS ILAGCCGVAMIDNTVLVHKEVKILRK 
* 1 1 KK* CI * * FRGHSK/HNFSG * AWWLM 
PVIQHFGRPR*GSLEARSLRPAWATWQD 
PHLYEK 


12754 


26655 


A 


12883 


656 


1168 


E PSRGVWPHEDAR INGS KKKKS KDKKRK 
REDEETQLD/IVGIW*TVTNFGEISGTI 
AIEVDEGTYIHALNNGLFTLGAPHK/ER 
I ALKPGYGKYLS INSDELW/GRSDAIG 
PREQWEPVFQNQEVRNGGPAEMGEEKRN 
GTKWREDTDHTSFPLFPSTGGQPKAHSN 
WRKVCH 


12755 


26656 


A 


12884 


205 


452 


ASGSAHRFRTFSFETGRVLDKPSQTDHT 
NRKRGGPNVK* SFL/ WLGAAAHACNPNT 
LGGRGRRTA*AQEFRTSLGNTVRPCLY 


12756 


26657 


A 


12885 


112 


290 


KYNI I Q YNI VFS CHDPLVS FNL FFFF FF 
GKGGF P L S PRLKARGR VWG YLN P P L P GK 
RNFWAPPPQGFEIKGLPPP*SPPFGGVG 
QAKNLGPGFQPPPAPMGKTPFF*KNQNN 
PNWGGQ PLNFKTLGGGGP KI S F / PPGGG 
GLNNP KP S PG P S T WGKGGN P L S Q KKKKK 
KD 


12757 


26658 


A 


12886 


406 


190 


TRS LARGGFGR P LQ KP YWNE Q S F I MAKK 
\YEQPKCPWTDKWWKMWSILTVG*YAA 
MKRRDVLTCATITCR 


12758 


26659 


A 


12887 


440 


160 


RVCVCVCVCVCVGVFVCFALWLSSLIYN 
* VYLFLFVHNLGRVSFSYFLNLI * FFEH 
VNHV/HNYVQMIQNTKLYLKSYLETFH/ 
CPC*FPPILLH 


12759 


26660 


A 


12888 


201 


411 


HIWDVI LIRHALGNFLTS IS CYI VKIWL 
DAVAHTCNPNTLGGQGG/WIT*SQEFKI 
ILGNIGDPLSTKK2STK 


12760 


26661 


A 


12889 


138 


464 


S CTMNP S EMQRKAP PWRWRHRS * APSTH 
KMNRMVMSEQMKLPSTKKAEPPTWAQLK 
KLTE LA/ TKKKS LENTRGTQTS KNMLFA 
ALMI VS TGCAGVP S S S KETAT I EDKP 


12761 


26662 


A 


12890 


146 


366 


FS VSMKYFV* IH* FYHL/ KLLLMVYL* Y 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GFLAINTS QN I S CS QKIM KLHL * GRGW 
AHACNPGTLGG * GE WMA* AQE FNTGL S NT 
TAKPYLDF 


12762 


26663 


A 


12891 


368 


50 


KGRQSGVS S F P VGT VTNYTNL VAHNNRY 

lfphspgcqtsevkvtMgcsllkasrg 

EGFFPLPGSGGSKCPLAYGCITPISASS 
CDLLLWVCVFSSYLAY*DTCHWI 


12763 


26664 


A 


12892 


227 


456 


KLTTLLKLFE K\ I EDRTLS KS FYEVS I T 
*IPSSSSSSSSP/ISLINTDAKKINKIP 
RNRIQQCIKKIIFHKFSTGIYS 


12764 


26665 


A 


12893 


461 


89 


I YQNLSRKPDNI KCW\ *RCTE IQTLIHC 
W*QCKIMQLLWKI IWQFL* /ELNTEVPY 
DSIPLLDMY/PKKTEC/YTSTQTLIYCM 
NPIEYYSAIKRDKVLIHATTWMNLENI / 
ISKRRQSQKSTYCMIPLI 


12765 


26666 


A 


12894 


222 


185 


RIVMQYLHLNLGACVCVCVFVCD/ CVCL 
CV*MCVHIHIYISTHTYIHIYVCM*ICR 
LS**CEFYS 


12766 


26667 


A 


12895 


186 


166 


I FFFFFEMGSHS VTQARVQWHNHSSKQH 
PPP\GASDPPTSAY*VAKINIFFFF 


12767 


26668 


A 


12896 


55 


485 


TCWDCRDEPPRPAIiESVFLTSFIiEDEFS 
IEIADLVPHPTPCSHPLQPP**VIIYFC 
S \ KPLVTGS S S P PRKP FPS LHE PKSAS L 
*PPSSTIQPVRSSPKG*PSPPLA*VQFP 
KGMT I PTLCLS P VL P VPST I PLPNVS KP 
LPFY 


12768 


26669 


A 


12897 


923 


312 


VD V I RKVQ I E I TLRCHL I HVR I A I I KKT 
RANKCW*GCGEEGAIiVHC*WEC/NYILV 
QPLWKTVWRFLSKLKIELPYDPAIPLLG 
ISPKGNEI 1/ CCTPKFIAAQFTIVKIQT 
QPRCSSMGKWIKKL/W/HIERSIDR*LE 
Y YLAFKKKAVLS FAKTOIDLED IMLNE I 
SQTQKEKYCVISLICGIKKKKVKYIEIE 
NKTVI TKVRIRWRK 


12769 


26670 


A 


12898 


320 


3 


ESWRWGRIVRPHSVPGETWLCPVPPGPP 
NAPDGSFPLAICPVPPSCRSL/CPP/DD 
CLASAGV*GGFPLPRTEASPHHSWACPQ 
EQPWPPQCKLKVGAPNSTALSEGS 


12770 


26671 


A 


12899 


138 


447 


I FVFHYKTQNI YNEDTLHVI I INLWCYP 
GS YI KSTGRLRPE VERGLGPTTMCC* FS 
S IKNI FYF* KLKS YRKL/WLGTVAHAYK 
PSTLGG* GRWIT *GQKFNRD 


12771 


26672 


A 


12900 


279 


1 


INQSIHNNIKSTYISQCKLIQP*WKIIW 
RFIR/NLKIELLYVPATPPVGV*LKELK 
SAC*RDNCTT/M/FTAALFCIGYGSNLG 
NSTALIGSRVLFP 


12772 


26673 


A 


12901 


235 


183 


I INYGGGP PEKKKKGGKKI PL I FYKKI K 
KFLWPKKKKLFGGAKNPI V* AHHI FTV* 
IDVDTRADFTSATI \ I IAI PTGVKVFS * 
LATLHGSNMK* SAAVL * ALGF I 


12773 


26674 


A 


12902 


180 


19 


LFIYLFIYLFICEIRCHFVTQAGLQW\L 
IIASCSFALLGSSDPPTSAS*VKGI 


12774 


26675 


A 


12903 


406 


949 


APVPGHGFCVSF/ CGENGVSLYCPGRS * 
TPGLKLCSCL\CLPESWDYR/R*APVPG 
\LGFVFHLTL*KPPFFKDYMKSLFQFLK 
YL IQG * CTL V* GVR YR SSFIFFFF FLRR 
SliA/SVTQAGVQWRDLGSLQSPPPGFTP 
FSRLSLLSSWDYRRPPPCPANFAFVFLV 
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Amino acid sequence (A=AIanine 
C=Cysteine, B=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, LHLeucine, M=Methionine, - 
N=Asparagine, P=ProIine, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possibIe 
nucleotide insertion 














EMGF \ TVLTRMVW I S * P CDP PASAS QSA 
GITGVSHRARPKFVLYFKDNGEPLGGVL 
HRSHIAVHWM 


12775 


26676 


A 


12904 


265 


385 


GNGS P S LRW I GVLRGRS PTLELRH *PIF 
LTAASRRSLDRVSVPMWGTFLSEPLSIE 
GLVSRY LTNNLMER I P I L YRN/ PFNNET 
M/PESLCYRVLIFLSKGYPRVKGRLDTC 
YSPWRSPAKKASFLPDAPRLACVKPVA 
SVHPEPGSNRTRG 


12776 


26677 


A 


12905 


363 


137 


APGMQA* WCM/HC I PGAQEAEAGEWTEP 
RSSRLWCTVIMPMSSHCTPAWATW 


12777 


26678 


A 


12906 


315 


565 


TPVDHGWLECSGAIMAHCSLKFLGSRDP 
PASASRVAGTTGVCHHSQLI FLFFVGTR 
SNFVAQAGIELLG\ *VILLPQSPKVLGL 


12778 


26679 


A 


12907 


376 


412 


DAWPTWRNL* FGVYVMTDR* EPRQMLTY 
LTA\ PLKYFKS GVMAHS CN\ PS TLGGQG 
RWIT*GQEFET\LANMEKPCLY 


12779 


26680 


A 


12908 


215 


2 


NIHINGQKLFL/WLGWAHTYNISTLGG 
RGGKI S * TQE FETS LANT VRPHLHKKQK 
IALPLPLPLPLPTVR 


12780 


26681 


A 


12909 


307 


379 


KR* KQPQ YPLTEE WINKMRHV \ LQYYS I 
LKKKEIL 


12781 


26682 


A 


12910 


279 


123 


KMILSWARWLMS I IPAAWEAEAGGSVEL 
RS* \LRLE*AMTVPLRHFTPAWARE 


12782 


26683 


A 


12911 


1 


194 


AKQH I PLLARLE CI/ GLFTNT VTAHCS L 
ELLAS S HP PASAS * VARMTECVHRAWLE 
WIFYLCVIL 


12783 


26684 


A 


12912 


206 


388 


IELSKQGPVYYKWYVRITPPYFPFFFFF 
ETGSCSVT\RLECS/GTITAHCSLNLPD 
LSYSPTAAS*VAW 


12784 


26685 


A 


12913 


238 


426 


TPLLPSQQWLKKESLWLGRVVHACNPST 
LKG*GGRDS\RGQ* FETSLGNIARPCLI 
REKFLKI 


12785 


26686 


A 


12914 


115 


1436 


AKDRHFTEV* IANKQMKRCSTSLGMREM 
QS KI VRYHLiSE * /R* LFF FNGGNCWQG / 
CKK/IQDLIHCWECKMVQPLWNIIW\RF 
L *NHTCI CHASGIALLS I YPREMKTCVH 
TKTCIRMFIAALFVMVETWRQPIYPSLG 
G 


12786 


26687 


A 


12915 


282 


21 


I KFP P C P ST WGTKKKLFPKKKKE VMKML 
CFHFNLLVQN I F * YRLGNE I SDKAG\ P V 
AHAWNPSNLGS*GELIT*GQEFDTSLAT 
IVKL 


12787 


26688 


A 


12916 


254 


150 


T L P GRS S D FS L P E V/ YDKKNXnFH / WLGW 
AHSCNPSTLGGKGGWIT*GPPHMLKVSF 
LPTDPTSRVKQLYNWVPHSTVLIALFPT 
KNPG 


12788 


26689 


A 


12917 


181 


24 


AQWCDG I P VS KGNL / WLGMVAHACNLS T 
LGGPGRW IT* GQE FE TS LANE FHRTD 


12789 


26690 


A 


12918 


75 


298 


LFILFTS*VD*MRPTHIKENNL\LAQST 
NPNVHLIQ/KHPHRHSQKRVWP1TVWAPH 
G\QSSCHIKMTIMARPSGSQL 


12790 


26691 


A 


12919 


251 


9 


KFMFLARCGGSRL* SQLSGRPRRADHLR 
PGVQNQ PGQHSE S Q YLPKS TNFCRVWWC 
TPWPATREAEA\GESTEPGRQRLQ 


12791 


26692 


A 


12920 


198 


211 


RFCFS VGDEREGKRA/ RERNRETERKKQ 
TEDTETQREERDRDRKR\ERETDTERER 
PREERHTEREKK* KAAFPFQTLGCRAQD 
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peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, HNHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FPKQLFPSRGWGVGPMTVLL 


12792 


26693 


A 


12921 


438 


127 


R WMER I FVRPH I KDSMWWVRS TLLGRG 
DALNDLFLYCWEKEDPRYKERP\RERER 
ERERERERDRCWEK*APGWRERPVQSLR 
ACHPPWHRFALAGPVPSAQ 


12793 


26694 


A 


12922 


158 


460 


E AR VE XjLi L S S RC KP RAV I TAE T QPAVYR 
LtQLEIENFPGLSPRLECSGAITAHCSLD 
LLG*SDPPNSASR\AGTTGMCHAQIVNS 
CKADWLVLLSSRGLTHR ; 


12794 


26695 


A 


12923 


33 


484 


GIQAACIPGLRLSPPADAHCHRHIPSPP 
PSSTAPRDDVGPMQITPDNFPPQDPPLX 
TPPSQSLtLPHRAP * PQGSS VDLPGLLFK 
LPQMPQTPGPEEVDTCVQIHKTSKARED 
VQMR KT PMARE GRHRAArlRW I GALGKAP 
QSQRRLRHTVG 


12795 


26696 


A 


12924 


335 


332 


RI KVYVRGHAKE * LRPGP / VAHTCNCHT 
LGGQGMRITQAQEFETSLGDIGRPHLYK 
KKKKKGGPF 


12796 


26697 


A 


12925 


302 


407 


AQWLMAI IPAPWEAEAGGLLEPRS* \AW 
VTW 


12797 


26698 


A 


12926 


766 


244 


RVPRL PPPAL * LEFS PSGLRLARSHS FQ 
LRSSPPPVSPPST/PFSPHRSPPLSLLS 
LSVPS/PPLHHSFFSQRAPHILPPPPLI 
FPSSPPPAPPPPPPSSSSPSSSSSPPPP 
/PPINWGTPRFFPPPPFKNPPPEINFGG 
PKKKKIFSPPRA*KFVFLKGPPPFFFFF 
FFFFLGINLL 


12798 


26699 


A 


12927 


127 


720 


WCGLLESTELQLQSLPGLYVSVIFGKIF 
YISGLGICCTKIFPSALPSAQPAASLPL 
PARS ALG I VFFLHF C * I E / CNYKKLP I F 
HHI * L I KTFS YGLT * LFFFFFKTKSLI F 
S PRLNCRGPFLVN * NLCLRG * GNSPA* P 
FGGAG I EGAPHKAGIa I FGGGGGAFLKKT 
G\LHHVAPGGASNSGTGNISHPPLNPPK 
GAGE 


12799 


26700 


A 


12928 


308 


49 


KEHW*AVGHFNICVTGGP*GERKKGTER 
VFKE I I TPNFPNL I QT I KPKTQE TQHT P 
SRRNTKI /TPRYI I IKFLKTSNKEKILT 
TAR 


12800 


26701 


A 


12929 


375 


2 


GCiVILSLKTINILEVIiANVLAQKKIKRRK 
LND\EQADTTFTI * FCVRQKTKKSTDKL 
VKL I / E RF YKVAT YM INFLKKSNS FHT P 
ATINC * KQ I P F I MATKT I KYLR INPRKK 
CVRFVGKKLTFPER 


12801 


26702 


A 


12930 


202 


379 


S S ENF S GKT VNL C CLGRLHRM I S WLHT V 
AHACNPIT\GWIT*GQEFETSIiADMVKP 
CLY 


12802 


26703 


A 


12931 


209 


376 


TVGEKMYLKVKRI \ WLG WAHASNPNTL 
GGQGGWIT*PQEFETSLGNMAKPHLYKK 


12803 


26704 


A 


12932 


280 


31 


I S CKVTKKVTLFELKYI WNKF YI YERVS 
I VGTTE DAACE VT S NVI TG P GAVAHACN 
PST/RGRWIT*GREIjKTSPANIAKPRLY 


12804 


26705 


A 


12933 


1 


189 


QMYTE SEP* CKLWTLGD YNV* M * F I S * / 
IQCTTQAGNVNIRE/RLCMCSGGEYMRN 
LCAFLSILL 


12805 


26706 


A 


12934 


108 


409 


HHSHLLKLPNVWFSFCHFIVTVFLSL*R 
VFFVFY*IFLSSSHFLFSFCFCPIMFIM 
FPFKFIL.SCNIVFI *NI IFFIiF/VHV*G 
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C— Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














*STLVSLFVFFFFLLQF 


12806 


26707 


A 


12935 


225 


11 


ILLCMEPS FL F W FG F I C F VKTGS H Y I AQ 
VEVLWLFSGMIKVHCSL/KLLHSSNPPT 
SAS * VAGNTGSS YCTW 


12807 


26708 


A 


12936 


74 


12 


GTTHASAGVIN I LVF I LS VF FLTHSLC I 
HIS KYMF VYI C VC I CTRTD F YTE FKS F C 
TFCFFGSS FFFLE KKFPLVAGLEEQGKD 
LG*LKPPPPRLKKFSCLTLPG/AWE*RT 
NG 


12808 


26709 


A 


12937 


166 


311 


PHS * WE CQMVRAPALENWQ FLTKF IRGL 
PL\DAALPLHM/CHRKSCTQLFI I ALL I 
VAKKCNHHKNL * ANE WINKM * Y I HTTD Y 
PLAIKQNEILIHATTWMNWTL 


12809 


26710 


A 


12938 


51 


296 


LCMYTLHTLYMHTCIYAYICYTCKYIHR 
Y/LHT* I PEDANS VKHKYTHTHI YVHTL 
VYPHTTYLMCSYIYSWVCVCVCVYIYT 


12810 


26711 


A 


12939 


426 


28 


KGFNL VN P RVF P S PR F * T P AP VF Y FGGP 
IKKIFFFKAGGVKFDSLKRAPLFFFFLK 
TGS C F VP \ RVE CNG 1 1 TAH CNL CL PRS N 
DPPPSDLQVTGVKPPP\AKLFFFFLW*M 
GFHH1LDYQGETCLSQLLGWR 


12811 


26712 


A 


12940 


11 


213 


ATAPGLF S FFE TGF CS VDQAGVQ * HDHG 
SLQP*PP\GSSDPPALASQSARITGVSH 
CKGPEHTSLLT 


12812 


26713 


A 


12941 


416 


38 


WLNNHSRLGFPNCWDYICKLPRLGPDVA 
SLL I HNS S \GTW YGATKLE SNW YFS I P I 
YKDHQK* FAFTWQGQQYTFTVLPQDYVS 
SAL * HNTVHRDLDHLDNPQN I TLVHYS D 
DTMLLDLMSRKYQAL 


12813 


26714 


A 


12942 


361 


74 


I IHPS FSVS SRNIHKNAYPAALGGYSVY 
GVAILLFHYFINKLAFTLHCGLA/LNSF 
LR* IQEPPLG\SGSGPFSCNVFLAPTEG 
TIVQKPDPMATFG 


12814 


26715 


A 


12943 


125 


401 


SSFEPRGIPLNPEVAMQKESVNILCSPR 
SQEFLESRIKIKVLTDLTQDEL*GQAQ* 
LTPIIPS\IWEAKAGGFLEPRSLRLRSC 
HCTPAWVTE 


12815 


26716 


A 


12944 


92 


306 


KRTTNSPWGCSAY/GVAILLFL* IL/KL 
LAFTLLHGLPSNSFLRKIQEPSLG\SGS 
GP I FGNKLVAFGGLM I WS 


12816 


26717 


A 


12945 


322 


24 


NFKKN I NLNPFLRGYFL FLKGGFGLTFK 
KVGEPLKKKKKKLNQQFSYFLSTYTVK* 
NN*/WLGAVARTYNPSTLGGRGGWIT*G 
QE FGTSLANMVKKI LAK 


12817 


26718 


A 


12946 


2 


448 


GGAPMKRHGSLDRNYRVDTTRPSQLFLF 
QLYHN YLKFAYEATKE YMHRKETL CLPC 
PAIQFS-FPKSNLCYRGYLCMRTCAHTQM 
*MYRYIYTWYMHIYICM/YRAICTYYT 
FE * KYILQKCILLCI FLFFLQYLKYVS Y 
LNRHHFL I AA 


12818 


26719 


A 


12947 


155 


415 


LLETGGNAGCSSWMHVLLGKIHRYGMGS 
RPISLMAG* \ GP P WLRRVAGLRGR PATL 
GLRYGPDSYGRQQW\EYCTMGASLMQRR 
RVRD 


12819 


26720 


A 


12948 


247 


250 


KYVYKY IH I VYKCVY I FTHVYGYT YL YT 
D VYKYT YL YTDV* I C I C I YKC I YL YW YI 
YIHLCINMYI YIHLC INMYIHIHV* IYI 
HIYI\HVYIFIYIFLYTVYK*IHIYEVC 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L— Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, A=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














EYIYTLWKL 


12820 


26721 


A 


12949 


25 


407 


E ALS L L LLL VLWGLLLG I AR KM YQQMNL 
VQYCLLVL*P*NL*EQKRFEREVGLDHR 
EP*FHVLGCIHSEKSFRFWSW*EL*IRK 
LTLVA\GGVAHTYNPNTLGHRGRRIT*G 
QEFETSLGNWRPLFL 


12821 


26722 


A 


12950 


370 


106 


I L YLETLIiKS FISIRS FSAVDSHFE KS S 
TVGKMLPNS ITCTDP I \ FNKKKSQLMWQ 
SSLLPFSQKLPQSPQPVSQQPSPSRQGS 
I PATR* QLA*GSVQVMLFGS ILPTVELF 
SKWESTAEKLLILINDFSKVSRYKIYV 


12822 


26723 


A 


12951 


280 


319 


D Q PGQHGKTL F * VS H * KKKKGWM / PGVY 
AYNPRFLGGRGGW IT* GQGFE TS IiANMG 
KPCFY 


12823 


26724 


A 


12952 


337 


113 


IQLAISLLGICPRDLNTVTCS\SDTCTQ 
MFLS VLLM VAKR * KQ P KYLQ I DE W INKM 
QCIHMESYLVSKGRKYLCKL 


12824 


26725 


A 


12953 


9 


186 


DPTVCCLQDTHFTYKDTNKLKVMGWTKY 
/ CHANGNQ KRAGW I Q I EYP * S KS LKSE 
IVQN 


12825 


26726 


A 


12954 


46 


381 


NENTYSYKKMTQMFIAALFWTPNWKQP 
TCPS/SGEWINKLL\YHLMVYYSAVKTN 
/ ILVYVATSVNLRI I ILGKQPDKKRVLT 
V* FHLYRKYKLI S SDRKQIHGRLEVENK 
GK 


12826 


26727 


A 


12955 


19 


250 


CS DMVLKAC I KKLMS YS KMGH I QAMD Y Y 
TAL/NKEL/LMYATI *ANLEDIMIiSKIR 
QTQKDNCLMVDC I Y I RYAEQ S S L 


12827 


26728 


A 


12956 


173 


163 


GAKHSASGTGWIRKDFTRPGAVAHACNP 
S TLGG * GGWL SRSEFETS LTNM VKL / CL 
YKS I CCTRCLLGCFPQALSKAI D 


12828 


26729 


A 


12957 


301 


79 


EKDNQPLLTPRNCNVLHAIKESENHRIT 
ELKVTLRPDAVAHTCNPSTLGGRSG/WI 
A* GQE FE I S LAKKVKP CL Y 


12829 


26730 


A 


12958 


314 


126 


KHHHFKKHNFRVL C VCVCVCVCVCYKHF 
YLFSLSFSQ*P/CICC*CVCCSFLLCCF 
*SATYLKII 


12830 


26731 


A 


12959 


529 


1697 


VPFMGHISNFFFSPFLRQKYLALLPRLG 
VQWQ*SRAHCNLRFPG\SSNSHASASRV 
AG I TGACHHGWLL F VFL VE TWFHHVG \ Q 
AGLKTP* PQV\SACFSLPKCCDRRCEPL 
CPATSS I LNL * TMDRL / PGE P KNQ * NLL 
VSFMCAVFYREGQQHVYFQRSLWCKREF 
IVRSPWAINLEKGGLRMGAV*IRIMLNL 
TLT*GFCTEDEKTVNSRVNQKLTPT*LG 
DE * D VKRE KNI ND E I KLLT CNS E YTKS Y 
RLGR* I KLGAFCF FM \ VPMGQTGRNGQL 
KVQV*TGTVAHVCNPSTLGG*DRRITRV 
QE FETNLGDHS ES S SLQKI KLARHGAAR 
L*SLSL*VSWDYRCSPPWP\SNFCIFK\ 
RDGVS P YWP KA/ WS * TPDLQRS AHFS L P 
KVLG 


12831 


26732 


A 


12960 


1 


395 


GTRE FR I L S E KFKKE I D F F F FFKKQS S P 
P PGP KKQTL / C * KQTTPTKPQKTNLGP K 
KFGGQPKKPFKISHPQIGPPKKKNYNP* 
GQ P VWNP PGE KKEG P TP P KFKD P GANL K 
VMGLKGGVKRKRGGKTFPQRA 


12832 


26733 


A 


12961 


90 


436 


LWRLTSPKSAGWAGNLETQEESMLQFKS 
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X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EGHR/ PAESPLA* SSVLFSLSTDWMRP I 
LI/MEGDLLYSKPIHF2STVKISSKNTYRN 
IWN1WWPHV* TL* PSQSD/CLK*TITP I 
VLLTCTH 


12833 


26734 


A 


12962 


415 


463 


LRAKDQ VLP K I L VC * TL L FF CAQ I KKKK 
K/WPGAVAHAYNPSTLEGQGGWIT*GQE 
FKT S LGNMVKP S G I P PHW I 


12834 


26735 


A 


12963 


774 


316 


SIS PT CSGKI GGKLNRHFS KEDKTG \RY 
MNKCFISLVISECYLKP*EDTH*TP\LR 
MAKI KRADH / DKC * QRYKMTGTL I QC * R 
ERKMVQPLWKTV*QFLKRLNIHLPDDS I 
PLLG \ I Y * RKMKACVHTNT 1 1 WMP IAS L 
LVKAK\AKKQPKCPSTR 


12835 


26736 


A 


12964 


44 


395 


MYFTIiVWGSLGPKFLGGHQPLQGQGPFL 
HFLKAL* *FPFPQTPGNGGGTPPAKTFP 
PFPTPLSSSSPCPSPAEGGSREPSLSTP 
/SCLHLYIGGAASNPSPPSPLPPPQCSG 
LGYPVC 


12836 


26737 


A 


12965 


424 


50 


MAFPFLSCAPDAINNNNKKCCREINNEE 
PPNE P LKWRH I M I F F FEAE SHS FAR \ LE 
CSDAISAHCKLCLPGSRHSPASAYRCPP 
RHLANFSAFLVQTGPPRVSQDGKDLQTS 
*SALLMPPLDPAIP 


12837 


26738 


A 


12966 


245 


44 


QSKELGNYPLPSPSPSSPSPAFHGLPPP 
SSPSPAFHGLPLLP\SWTVQPQ*RLTAT 
SLPDPPASPRPG 


12838 


26739 


A 


12967 


205 


30 


Q EGAS LS R I KRG PGVGAWLGMVAHACNP 
STLGGQG/ SWIT*AQEFETSLGNPHLYQ 
KYKN 


12839 


26740 


A 


12968 


114 


465 


DPVSNPETTKPEPPKKTTES * P/ EPKSL 
LWLPS \ P *NSS * CANREQQPPS PQPHLP 
HPLPTSLQVHRLP* *SLPPPN1>FTPLYD 
/SPEPSLLPVIQFPA*TSPP*PPVYSSP 
PSPTCPLH 


12840 


26741 


A 


12969 


1618 


1038 


VHMVQD KD I NLNNQ FLSGTMLL F FKE T S 
HR*DRGPGDF*SCLYSAPIPTQPLLF*Y 
LPQNFIEG* PS *NSRI KGTSLKTYISRV 
PSFFFSFL* DGVS \ TVPQAGVQWHDLGS 
LQP P P PGFKRFS CLS LLS S WDYRHPPLC 
PANFLDF* *RRGFTMLARLVSNS * PCDL 
PASASQIAGITRMSHHTLPHVFPLNV 


12841 


26742 


A 


12970 


197 


411 


EQALRSTVCYCLNPLPPAPLRPRQDC\G 
PCRFRWMPGGRTRWLAPVIPTL* EAEAQ 
VWLE ARS LRQAWATW 


12842 


26743 


A 


12971 


616 

t 


777 


MGIHGVRHNAQLNFVFFVEDGGEWQLTC 
TGAGD S S WVLGE P * VFAC/ P KNVIi F YLH 
P*LFG* I * IYQLKITF*NYEGMVP/FVF 
RFLFETRSLLSPQLECSGTITTHCSLEL 
VGS INSS ISAS*LDGNT\GVRHNAQLNF 
VFFVEDGGEWQLTCTGAGDSSWVLGEPL, 
RYYKERLIKWYL 


12843 


26744 


A 


12972 


222 


389 


VITEALGSDELEGRQLWW* IKIHSQVQW 
L I PVI PAL / LEAKAGGLLEAKTSRPAWE 
T 


12844 


26745 


A 


12973 


425 


145 


QIIFFLFQKIKNQFNKASFFKKMTLEQM 
TSRQKNSAQISVFKKKNSKWITDLNVTH 
*TIQLTGKN\NIGKNLQDQGLGKEXLNL 
TLKAQSTKG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Ij=Leucine, M=Meth ionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12845 


26746 


A 


12974 


322 


411 


NNKKI FMSTS * KFFYCF I VQL I FET I LS 
raVLESIILINYL*YCIICIFPFLFL*T 
*EI RKVTKFTS FPQIEEK* KCRR/WLGT 
VAHTYNPSTLGG*GGRIA*AHEFETSLG 
NTGRIiPSVQKIL 


12846 


26747 


A 


12975 


428 


1 


SRKSSSQFRLSFFPSHFLDLGAKGAVSQ 
DRAIALQPPGPRSR*S*RPPSRPRCFRR 
PRPSRSRSNKERAGEKRRPRRSLQPRPP 
HIAGP/VPRRQGSPSAGPEAPLATDEPP 
PQRRS G I RGGS AH IRS KQT S S LRPRVRG 
RVG 


12847 


26748 


A 


12976 


60 


294 


NHCTETVLFQ*LTCVCVCVCVCVCVGVT 
ERl/CFFKFLIGKGTYGCIYLWPKTFIL 
IFI YILYSPRGRS ILKKKFFFLT 


12848 


26749 


A 


12977 


6 


341 


DSLTKLPGLECSGAISAHCSLCLSG/ FK 
QFSCFSLPSSWGFTG/VPPRMPQLIFPY 
FSVKTGFH\HVG * AGLKLL IP PAS AS PK 
CWDYRREPPRPGDLWNF*PTGFKLRFPQ 
FPL 


12849 


26750 


A 


12978 


296 


42 


SQPASLGFKSVDLPLLAVCPLGKITLTL 
WDFSFIIYIKQE*WPRAGIFMPY\NPST 
LGGRGGR I S * AQE F KTS LGN I VKPH YL V 
K 


12850 


26751 


A 


12979 


20 


373 


KLYGG I WGF FFFRRGLNS ATQ AEMQGHN 
LG * LKS P PPG\ * RNFPAS AS GE PGI TGS 
HHKAGIVFYIKKKPGLKTFTLGKPTPPP 
S FRHWHQ PAPMGVYS EILGEKSL TNG WD 
PQTKKG 


12851 


26752 


A 


12980 


380 


96 


PVFPP PWRAQKGGS IGRE IRPPRGNQGK 
PGPP*KKKKNYPGGGGGPPSPPFPEG\* 
PKKWHNPGGGTPP*TEFPPWPSTRGGEK 
KPPSKKKKKEK 


12852 


26753 


A 


12981 


331 


71 


ELLIYSWVWWCTPWPATW*GRELVGRI 
T\KSRRLRLQ*AKIVPLHSSLGDRVRPC 
PPPPTKKKLANLCPWTEGSLSFSYLLEQ 
FHSC 


12853 


26754 


A 


12982 


383 


189 


RRALKS CAHS T PAGP KNAALS QAQWLTP 
VIPTL*EAEAGGSLELRSPRLL*AMIEP 
\CTPAWVTE 


12854 


26755 


A 


12983 


144 


6 


I LAQ * KT WLS WAHTL * S / TLGGQDGR I 
TGAQEFKTSLGNTRRPCLYK 


12855 


26756 


A 


12984 


341 


509 


ETLYTME*YTAIEKNEIMSFAGTWLELE 
A\VIL\SNLV*EQKTKMLPCSPLMGWEL 
K 


12856 


26757 


A 


12985 


387 


125 


DLPPTPPKLEPRTCSVAQVEVWCL/GSL 
QPQPPGP\SDPPA*AS*IAGTTGTRHHA 
*EMFPF*VYFVQTSEHILGINPPGRHTK 
VAWEV 


12857 


26758 


A 


12986 


151 


350 


GRGGF* PLP* IFPPGQERPGFTPPKAPP 
PLGFFFPG/RFFFFLRRSLALSPRKWRD 
LGPLQPPPPPPTPGY 


12858 


26759 


A 


12987 


251 


1 


RMSLLEKRRSNN*EARV/ CGRVIYVATE 
VIKIMIGISMVAHAYNPSTLGGQGRRIA 
SAQEFETSLGNMTNPCLYKKYTHKKSRA 


12859 


26760 


A 


12988 


320 


3 


TPWGVKDFSPPPPPGEGVFPGRAPPPR* 
KFFGNFWKKRGFPNWVRGVLKFWG\KNF 
FPPNLPKSLGLRFKPPGPP*KAPPFFPL 
FFFFFEMEFRLCCPGWSAVAQS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Giycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


12860 


26761 


A 


12989 


353 


183 


E YGTH I LTAAI FS I AKRWAHR * G S L TD E 
WRKNSSLQSMEYY\ SAIKKNEVLLHAA 
P 


12861 


26762 


A 


12990 


377 


2 


LSIIL INFPRFSLQRGKYPGP P PPKKNP 
PQKTPPRAPKKKKPFFFFFFFSFFKKNF 
PFFFKGQSKGGFFGPP* PLIFRFK\NPP 
PPPPQKIGFKAWVFSTPPPPIWGFFFFF 
FLRWSLALLPRLEC 


12862 


26763 


A 


12991 


134 


298 


PPNSYCDGSITLIPKSDKNITRKEKYL/ 
P/IYLMTIDAKVFNKVLPNRIQQ* IKRI 
I 


12863 


26764 


A 


12992 


892 


171 


QT*SH/SLSRLEYSDTIIAHCSLKLMGS 
SD PPASASQVAKPTGHCTTMPQPHTEGF 


12864 


26765 


A 


12993 


269 


2 


GLWCFVKQLKLTETCPHFAVPQFPLGIS 
KIPISILFFKKLLL/WPGTVAHTCNPST 
LGGRGGWTT*GQEFGLMFHHWPSKFHHG 
QHGET 


12865 


26766 


A 


12994 


324 


4 


S L YPLL FQAE I YFL VT I HRS S ME S C I Y F 
F3 WVPLDSLVQIHVKI /HIKGIGML* V 
N*QNCFKINISE*L*YIWPSWADTLGC 
QGEITRAQGFKTSLSNMTKPHLY 


12866 


26767 


A 


12995 


106 


531 


WLLNLFAFLNDKGFLVPMNRSRI FSQQK 
VKQRS FEGCSS FNDVGS C YVTQAGEQWL 
FTGVIGTL*PGTPG\SSDPPASASQAAG 
TTGS*HHIQLNLILYSSG*QTI*LIi*TT 
DSTHLIFASLYAKEQLITFYFQATSLQP 
CT 


12867 


26768 


A 


12996 


1 


367 


NTEPFLNI FFKDKVS / CS I AQAGGQWRL 
YS RCSLKLLGS SNS PTS ASLVAGS KGRH 
HYTRL * LFF YVSRPQD YLPWS PVFFFS F 
FLSFF FWKGS F FLP P S WKARAP I W VNGS 
LPPRVKPIFLA 


12868 


26769 


A 


12997 


3 


379 


YRPSPSETKAVLVFFSFFSFFFLGTPGG 
KGPPKGAPNPTLTWRGKENFFI * KKKKP 
/RLI*KKGPPPKMGFKRERFWVLKPKGP 
QFGKGEKG/SPNYPHPRGDPKAPSLWKK 
PQFPGGGGGGAPKKAS 


12869 


26770 


A 


12998 


230 


409 


S CR P KKLLAFLKS S TE I S QR W KQ P I Y I / 
PSTDE* INKMWHIHTMEYYSALKRTKVL 
IHATT 


12870 


26771 


A 


12999 


310 


343 


DPVCTII*IICDIINL*IECSKW*SLKF 
YKILILFNVCVWPSVGNCHKWLCDCW* 
KQW / LWPGMVAHTCNP S T VRGGGGR I V* 
GQEFETSLGNIVRPSLYNNKSCI 


12871 


26772 


A 


13000 


390 


289 


IKLCQPRGVKLDSFKRAPPFFFFFETRS 
CF VS * AGVQWCN/ LGS LQPQTSGPNNPA 
TSA* LSL I FMCSNS YVC I VK 


12872 


26773 


A 


13001 


1 


289 


GI LRQ I S VNYLP VGRS LAQAGVQRCDLG 
SL*PLPPKVKQFSCVAGTTRGCHHAWLI 
FI FSREG\GHP VKGAS SDS PTFGFS KAG 
I TRLS PRALACT 


12873 


26774 


A 


13002 


359 


105 


KQKKNPSPPPGGGGGQGKPG/PPKPFPN 
SRDFPPKNQSPFFPKLKKKFFNFY*KAK 
NP*KPNPPLKKGKKPKGFFSPPPPLKKP 

PNV , 


12874 


26775 


A 


13003 


243 


293 


MNSQFLKRL\RIESPYGSVISLLGTYST 
EK*KACPCKTCTQMFTSALYIIAKKW\N 
KCPSTVRW/IKLWHINTMEFFCQ*KKFH 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny!aIanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, ■ =Sto p codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














CIDMPQFVHLTVEGHLSPLFCYYI 


12875 


26776 


A 


13004 


465 


307 


S EACF I LKLFMFCG * CCL FVC \ S ILLVF 
YFVFCFVLFCFAYYMQFFNQCLFG 


12876 


26777 


A 


13005 


207 


3 


DGSLTMLPKLEC/RWLFTGVIMAHCHLE 
/ RLGSND P PAS AS RVAGTTGVCHHA* L I 
FIFFVKMRSHYPTRP 


12877 


26778 


A 


13006 


2 


56 


DLALLFRL * CNGM I TAHCNLK/ RPG* LL 
GSRDPPTMVS*STGITGMSHEAHRPSPS 
CDF*RSCSWQAWQWHDYSSL 


12878 


26779 


A 


13007 


402 


74 


FGAS LLKRHFLFFLS KPLVYNKKI PRPL 
VFLKKT\PPPPGRATPFYYCCFFHKEGP 
GFGVKKEG* PKG/ S PPPPQRGS TPPFFL 
GAFKKPKVTTPFFKKNPPKTLKGPPFF 


12879 


26780 


A 


13008 


356 


145 


RPPPFFFF FE TVS H P VAHAGVQ WHDHS S 
LQL* TP \ S SREP PASAS * VAGTTGVHYL 
PGSQLP I LEHS PRR 


12880 


26781 


A 


13009 


227 


353 


GWSELT* / SN* AQWLTPVI IAL * EAEAG 
GSSEPRSLRSAWATW 


12881 


26782 


A 


13010 


249 


32 


NPDSHSWAWWCASWPGTQKARVGGSLE 
SRSLRLR*AMIVP\CTPAWVAQEPLSLK 
KPPTIETLFPSPQDPSQ 


12882 


26783 


A 


13011 


247 


403 


TPGGGGKYFFFWGPQKKNRGGGFKNGGG 
GKTRGP P * KQPAS S S PFAPP PFFWGAPG 
FSPPPRF\KPPPPVFFLGPPKKKIFPPP 
PGGLFFFFLGGPPSPFFFFFFFLGGGPL 
FFPGLGAMGPNAGFRSIjAPPGNSLSKKK 
KS EGLGEGGNS VLTRVLL I S S YQ I PGNP 
R 


12883 


26784 


A 


13012 


206 


386 


TREAQTFWFPLYS * KAL / WLGT VAHT * N 
PSTLGGQGRRITSSQESETSLANMVKPH 
LLLEI 


12884 


26785 


A 


13013 


354 


31 


YKMN I WD 1 1 H I * HT \ Y I PWP KGKERKR 
KKLEEYL/QDIMTKNFLNLMKNMNIYFQ 
EVQQTPSRINSQTSTPRHIIIKLSKYKT 
GREQWILSVMLALWEAEWGPPEVRS 


12885 


26786 


A 


13014 


1 


419 


EETSFGKARCLPGYPGFHPRRLRSRSPL 
LS*MPLSHPQGPSQPPAGSMSSSPPATT 
STCSCSPPAAGCTRGRTTGRPFWTASGP 
AELDWAS G S S V/ E ALT S GL P AR'GLW GQ Y 
AGLGVP PRLLGPMTS CCMSLP FPS PARP 


12886 


26787 


A 


13015 


109 


426 


TSLGNIARPH*QTHMATIRSVPPPSWKD 
ST*LRLHPPPPHMCF/PD*SLKSN*Y*H 
YLVNP \ I WPMLTLLFVCLFVCFRDRVSL 
CCPGWSAWPSRLTAASTLWVQVILPP\ 
RLAN* IFFLRWGLAMLPRLV*NSWAGDI 
L P P * P P KVLGLQ A 


12887 


26788 


A 


13016 


149 


422 


L KR I F FL KWFTHDTAP GDPDNMC P YS C 
F*NQI IKSGLFFFKKPHL/WLGEGAYPC 
NPTLLGGQGGRIT* FQEFETTPANMAKP 
LLYKKYKN 


12888 


26789 


A 


13018 


162 


377 


QSPGYPERVSKLPTITQRGMWMHQPSFV 
CTSL *NV* TGWAQ WLTPVI PAF * ED \ AL 
AGGLLEVKNLRPAWS T 


12889 


26790 


A 


13019 


205 


427 


IQTGGPPARPFLWPWAPGKK*RVPPEGG 
PP KKKKKTP P PHPGGEREGS / PFSKKKK 
KKTPKKRAGVFFPGGPRPKEGPGWGPPR 
LDSKPVGAPPEPGRLGNGPGKIFFGPGP 
FFPLQPQKGRP*PGGEGGSFFFFLGVPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cy stein e, D=Aspartic Acid, 
E=GIutamic Acid, F=OPhenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T— Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possib!e 
nucleotide deletion, V=possibIe 
nucleotide insertion 














LGEPSIFCLDLPENA 


12890 


26791 


A 


13020 


248 


358 


SRCGGEAQWLLP V/ 1 / PAF * EAEAGGS F 
EPKSLRPAWAT 


12891 


26792 


A 


13021 


2 


434 


ITEAGSDTQE I HHTL YTHL C VYWRF FT K 
KAQVKKKVWWHM* PQLLGRLEMGGSLEP 
RSLRLQGAMTLPVTSTT\QAWAT 


12892 


26793 


A 


13022 


162 


410 


LTQKTAASQI ILSFSSTSLEKIPISSES 
GGGGSVSVTQAAEQWRYHSLLQP*TPGD 
KRS SHLGI PHP \ S S WDYRI S KDWVMPV 


12893 


26794 


A 


13023 


250 


40 


L I KLNI FS CSLiAI WTS S CMTSIiFKYFAH 
FSFF* IDFYSSYYILDTNPFFRN/GITN 
ILSSSVTCSFILFW 


12894 


26795 


A 


13025 


264 


1 


VSSTKHGKLRLEPEY*CFIHH*EEKVFE 
SYFEY/PCQTFLRKETVRPGAVAYACNP 
STLGGQCRWI I * EQEFETSLTCIAAALR 
VTLR 


12895 


26796 


A 


13026 


110 


523 


CIDSSSWTQTDRCKEDRLCPGTTGTCAP 
GLLFLSAPPLGPF*PSQAHPSFHVSLSG 
PSPGLRPQLDI PLHPSTS IAPRPLPGPA 
S\LPRLEPHPSLPSPTGACPFLPPCLGE 
QRHWPLPTSLLWLGLDGHCLATPSVS 


12896 


26797 


A 


13027 


52 


53 


CPTLLQLTLC/LPPQASAMVDAPP*ARP 
LPPSSinACCTSGEQGSVWGLAEPVKG 
YNLLVCRLLRPLEKCGIWVAVSRFSLYS 
LSWLPLARKGXNPLTACASWVR 


12897 


26798 


A 


13028 


262 


381 


HFGRLRQVDRLS PGVRDSLCNKVKP * Q I 
IYTF* FSVSLSVINTYLKGPSTVAHTYN 
PSTSGG*GKWIA*AQEFE/TSLCNKVKP 
CLY 


12898 


26799 


A 


13030 


1 


846 


FRPIQAGRAPWALTLPCMGFTEGSPALIj 
SHSSRCDI QMG CNTHAVDD S TGE GP FHK 
GRMRE S CWLRTPRCGLYTKGVGGPGMEG 
RERRS RS RRQHMPGRGHS TMAS WSLRQ C 
MRLKLQWASDLLS QLEMQ IPSLP PGESG 
SLGLTGPRSLH/ ceqtiglsptpgesws 
EELG\GGIVGGGRGTVAGRGGHREAGNK 
PSGTWGRQCCASGDKAMS i *ltsly\ar 
RGE PG PGR P VRKWLRE S S QNR I GGGE GR 
WE*GQRRSREGRECPVGWGGRKKPLRWG 
LMWS 


12899 


26800 


A 


13032 


308 


1 


KHYHD F WCWI I LALSDAQS LDHTCV \ C V 
CVCVCVCVCVLRDQGSHSCL*CEG*LVP 
HFVPGPVLCLLSPVIYVCFICSLTCKVE 
SLiSNFFKKLNKALLRHCI 


12900 


26801 


A 


13033 


763 


244 


FLGPRI IGLRHEISVETQDHKSAVRGNN 
THDNYENVEAGPPKAKGKTDKELYENTG 
QSNFEEHIYGNETSSDYYNFQKPRPSEV 
PQDEDIYILPDSY^/CQNIDFCYWMIN 
I HCNFS TAKTRNQTKC* STVDW I KKMWY 
TYTI EYYAAVKK/ DTKLTWEQKI KYHI F 
SLKSGS 


12901 


26802 


A 


13034 


326 


58 


EHYKCNYCQKIFQKL* /HL*SKKHVYAE 
DKYYKYKDGWPLLVSQIILYTFYTRGK 
P * S S C S NF VQNQ 1 1 Y I GQKS C KCNKFGN 
TFFKKP 


12902 


26803 


A 


13035 


199 


3 


RFIYNHKVCITTFLPHFYF*KLYIi/WLG 
TVAHAWNTSTLGG*GGRIT*GQEIiETGL 
DNTARPHLHTV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, HNHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12903 


26804 


A 


13036 


94 


448 


GSCATSMAFKDTRKTAVEPGWVHGIRI 
KPLE KVCADL I TGKKFKNL / KVKGP VCM 
PTKTLKITTRKTPCGESSKA\WDGFQMR 
IQK*LTNLHSPSKIVKQIISISGAPGID 
SEVTNADG 


12904 


26805 


A 


13038 


964 


1717 


HKLS G VKDLKDFMRQAGE \VTFA\DAHR 
PK\ LNEGWE FAS YGDL KNAI E KLS G k\ 
EINGKNINLIKGSK\RPSRSRSRSSDPG 
TQKS PLGPRSRS P S PWRKP * QPVQEAR 
SREPGARSQVPVPVK*GPPPCPEKEPQK 
\RGSLQSKILSPPSILWIRPEVSGPRSK 
VPRSVDSGN 


12905 


26806 


A 


13039 


2 


725 


SLLFSPVLPLFLFLPLAYILQNLKPLQF 
S PDLKS KHL I FFCNAA* PQYKLDRS SK* 
PENGTLNFS I LQDLNNS CRK I GKWS EVP 
EVQAFF\SHQSLPSLCAQCNS/CPNLPS 
FPP\PVPSVPTPSVTESF*SSFSTDPSD 
LSLPPQALLLARPS*VPILPQPPLLHPI 
ILLSPPLLTPRPAYSFIP*L/CPPPPAQ 
QFTLKKVAGAKGIVKVNAPFSLSQIR*H 
LGSFSSNIKIQPSSWLVWQQP 


12906 


26807 


A 


13040 


1131 


2 


DGVSLLSPRLECSGTISAHCNLHLLCLN 
DSPASAS*VAGITGPATTPPLIFVYFFF 
LVETGFHHVGQSGLELPTD\SDEPPASD 
SQSVRITGVSH/AYLACSQF/CTRC*FP 
1 1 * EQCVNMNINSS K* IFIL I S YLYRWM 
NKDPI/VIIERLLVRMPEGQSHAAW*NK 
TNKQ T * I LEL VL S VT S CGVS VRFNL SIP 

* VFP STK* EGWMRCVNP STLT * F I PQVA 

* TCQEHGLMKI QT *NPGLD P S I VSDHQT 
Q*EPSVKQVLDISHDKLPMSLEELYHCR 
HTLAS GKQI SGVPRTI FLNHFHGF YF I Y 
/CIFEMESCSV/SQAG/VQWRDLSLLQA 
RS/PGSR/RFSCLS/LPVQHPPPGPANF 
\CIFSGDGVSPC*PGWSQSPDLVIR 


12907 


26808 


A 


13041 


97 


626 


I CFTQVKVLNE S Q VQ WCM P WPAT WE AG 
AGGVLEPGS * EL * / CPVNSHCPPAWAT 


12908 


26809 


A 


13042 


2 


431 


HDPD THS Q I KV I KLMQS E F CWGWAW * LT 
PVI PALWEAE / AGGS PEVRS S 


12909 


26810 


A 


13043 


405 


1319 


RYRKKLQNKPLWLH/CLLLLPNSATR* T 
LQK* SK* I SPLESSA* * KEPPFLSVLS V 
TSVKT / S AI PMP PHWENVNTQ VP YQL I P 
LHNQTHE YNEVANLFGKTMDRNRI KRI Q 
RIQNLDLWEFFCRKKAQLKKKRGVPQIN 
E QML FHGT S S E F VEA I C IHNFD WR ING I 
HGAVFGKGTYFARDAAYSSRFCKDDIKH 
GNTFQ IHGVSLQQRHLFRTYKSMFLARV 
L I GD Y I NGDS K YMR P P S KDGS Y VNL YDS 
CVDDTWNPKIFWFDANQIYPEYLIDFH 
* FHFQ I S WKEAL FFFAGRFALQS SSH 


12910 


26811 


A 


13044 


23 


450 


RSRAAAIQLGQRLRLPGPDSPRLTGRQL 
E PGTGRGSRAKRGP CS I FRFRRETPPQR 
EGGEALGATSRQAPPPCPPRQPWELDAS 
AKAAAVAP PKKVGFKAVSRD PRAEALGG 
KGS P \ GGKGS L KGVAHR * GGLGPL * GAP 
EAI 


12911 


26812 


A - 


13045 


172 


16 


AHVSPEVDKTKFW\LGPVAHICNSSTLG 
GRGGQ IT/ R/GQE FETNLTNMEKPHLY 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=AIanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=GIutamic Acid, F=PhenylaIanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=GIycine, H=Histidine, I=Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12912 


26813 


A 


13046 


181 


9 


YKWGPAMLPRLKCRAI \IGSNMAHCRLD 
LPGSSDPPSSASQVAGTTGTCHLGARGG 
MV 


12913 


26814 


A 


13048 


2 


3507 


YVRVS LP P P P PAAGRPGAAVADDAREEE . 
EEAAPPPPPPPPPRLAAARPPGSQPRPP 
AAGEAQAAADMNHQQQQQQQKAGEQQLS 
EPEDMEMEAGDTDDPPRITQNPVINGNV 
ALSDGHNTAEEDMEDDTSWRSEATFQFT 
VERFSRLSESVLSPPCFVRNLPWKIMVM 
PRFYPDRPHQK\SVGFFLQ\CNAESDST 
SWS CHAQAVLKI INYRDDEKSFSRRI SH 
L F FHKENDWGF SNFMAWS E VTD P E KG F I 
DDDKVTFEVFVQADAPHGVAWDS KKHTG 
YVGLKNQGATCYMNSLLQTLFFTNQLRK 
AVYMMPTEGDDS SKS VPLALQRVFYELQ 
HSDKPVGTKKLTKSFGWETLDSFMQHDV 
QELCRVLLDNVENKMKGTCVEGTIPKLF 
RGKMVS YIQCKEVDYRSDRREDYYD IQL 
S I KGKKNI FESFVDYVAVEQLDGDNKYD 
AGEHGLQEAEKGVKFLTLPPVLHLQLMR 
FMYDPQTDQNIKINDRFEFPEQLPLDEF 
LQKTDP KDPANY I LHAVLVHSGDNHGGH 
YVVYLNPKGDGKWCKFDDDWSRCTKEE 
AI EHNYGGHDDDLS VRHCTNAYMLVYI R 
E S KL S E VLQAVTDHD I PQ Q L VE RLQE E K 
RIEAQKRKERQEAHLYMQVQ I VAEDQFC 
, GHQ GNDM YD E E KVK YT VFKVL KNS S LAE 
F VQ S L S QTMG F P QD Q 1 RL WPMQARS NGT 
KRPAMLDNEADGNKTMIELSDNENPWTI 
FLET VD P ELAASGATLP KFDKDHDVMLF 
LKM YD P KTRS LN YCGH IYTPISCKI RDL 
LP VMCDRAGF I QDTSL 1 L YE E VKPNLTE 
RIQDYDVSLDKALDELMDGDI I VFQKDD 
PENDNSELPTAKE YFRDLYHRVDVI FCD 
KTI PNDPGFWTLSNRMNYFQVAKTVAQ 
RLNTDPMLLQFFKSQGYRDGPGNPLRHN 
YEGTLRDLLQFFKPRQPKKLYYQQLKMK 
ITDFENRRSFKCIWLNSQFREEEITLYP 
DKHGCVRDLLEECKKAVELGEKASGKLR 
LLEIVSYKIIGVHQEDELLECLSPATSR 
TFRIEEI PLDQVD I DKENEMLVTVAHFH 
KEVFGTFGI PFLLRIHQGEHFREVMKRI 
QSLLDIQE ICE FE KF KFAI VMMGRHQ Y I N 
EDE YEVNLKDFE PQPGNMSHPRPWLGLD 
HFNKAPKRSRYTYLEKAIKIHN 


12914 


26815 


A 


13049 


30 


477 


VRAEHCAVWERNFEETVRWTSVKFLMTS 
PEIASLSWGQMKVKGSNTTYKDCKVWPG 
GSRTWDWRETGTEHSPGVQPAAC\DVKE 
WEKGVQTLVIGRGMSEALKVPS STVEY 
LKKHGI DMRVLQTEQAVKE YNALVAQGV 
RVGGVFHSTC 


12915 


26816 


A 


13050 


179 


389 


NIETIQSMFPI/DNEMKLEINLKKAIWG 
IHKYASQVLWFMPI FPTLWEAKVGEFLE 
PRSSRSVWETWRDPI 


12916 


26817 


A 


13052 


466 


1489 


PGKGEMRTRS PSPLAIVPRPQRASRPLL 
CAVS PMAS AS GAT AKHEQ I L VLD P P I DL 
KFKGP FTDWTTNLKLRNPSDRKVCFKV 
KTTVPHRYCVRPNSGI IDPGSTVTVSVM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 1 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LQPFDYDPNEKSKHKFMV/QIFAPLNTS 
DMEAVWNEAKPHELMDSKWR/ CPNENDK 
LNDME PSKAVPLNAS KQDGPTPQPHSAS 
LHDTETRKLTEECKRLQGEMMKLSEENQ 
HLRDEGLRLRKVAYSDKPGSTSTAS FRD 
VTSLLPSLLWI AAI FTGFLFFFWRRSL 
TLVAQAGVQRRDLGS LQP P SPGFKQ FSH 
LSLPSSWDYRRPPPHPANFCIFSREGVS 
PCWPGWS 


12917 


26818 


A 


13053 


892 


189 


SYDPGALGCRPAPRSGRGKGSGATRDMH 
PMSRERAKFVKSGL\YCKTTRELMLHAR 
CCIiNQKGT I IiGIiDIiQNCSLEDPGPNFHQ 
AHTTVI IDLQANPLKGDIiANTFRGFTQL 
QTL ILPQHVNCPGG I NAWNT I TS Y I DNQ 
ICQGQKNLaSTNTGDPEMCPENGSCVPDG 
PGLLQCVCADGFH/G I QVYAPGLVLTAY 
VliRDSGSHHS IRLHSALGDPAPKSQDFM 
NYIGLTIDLRSI 


12918 


26819 


A 


13054 


423 


12 


SFNQKNPLRQKCSLRPTRPASQSLATWP 
GQS TRF WE Q AIiPWALWD P QS KRAE \ MQL 
E P PHAHTWTHAHTCGHTGA\ FCS AHTE V 
YVHLHS P VHAHTHAHTHS P P VHT CGCRIa 
HAHTH\ PQTPSPPGLLSLIiAVARPQS 


12919 


26820 


A 


13055 


16 


349 


RRSGKNDPWTDQS PRAAASHTRCPHPAV 
AAAAMP KRKTEGD AKGD KAKVKDE SQKT 
S / ARMS S KS AAS KAYAKP I KAPAMNGEN 
VPNG IQRKLEAF YDIiQACRHLTRIi IDFA 


12920 


26821 


C 


13056 


716 


594 


MMQTETGVMPPQARQCLEPPGAGTGRKD 
SPLYPSEQSWPL* 


12921 


26822 


A 


13057 


353 


2 


IGNPGKTTFNKPPPPKKFIFSKFFMFLF 
FLVKTR S C Y VAQAGLQTPG \ S S DPS AS A 
NQ SAG I TG VS HHTQS E I L K YT VF CLT F Q 
TPEYTIPSTWSIPCCSHFSCYLCFIiFAC 
FLRQSF 


12922 


26823 


A 


13058 


864 


529 


EVSTSKTAGCRGCALVISMLCI I FPQGL 
FTRALLRTCS ALLHADWPE I S FVWWF 
FKTE S CS VAQDGVQWCDLRSLKP PPP \ G 
S S D S PAS AS PVAK I TGMHHHARHGL KS 


12923 


26824 


A 


13059 


2714 


1376 


GR YD GL VE QLGGRAT PAVG FAMGLE RL V 
LLVQAVNPEFKADP WD I YLVASGADTQ 
SAAMALAERLRDELPGVKLMTNHGGGNF 
KKQFARADKWGARVAWLGE SE VANGTA 
WKDLRS ALAVG V I LGVGAL I GWRYWNS 
HQ VDS ARS AS LAYQNAVTAVS E GKPD S I 
PAAEKFAAENKNT YGALAS LELAQQFVD 
KNELEKAAAQLQQGLADTSDENLKAVIN 
LRLARVQVQLKQ ADAALKTLD / TP I KGB 
GWAAIVADLRGEALLSKGDKQGLLSVTL 
LSGCSLFNSEEDWKMSPLPTVENQFTP 
TTAWS TS DGSG I GNFYSNLHPALADNW 
YAADRAGLVKALNADDGKE I WSVSLTE K 
DGWFSKEPALLSCRVTSVWPNNQTEKFN 
KQRTEGPAVAE P E QAGVRRH YL Y P F TGG 
YRRRRAACQAERPAARCATEEHGSRN 


12924 


26825 


A 


13060 


99 


254 


RNHHLVAFKELSAL.KVKRVDWAPWLMPV 
IPAF/LEAKVGRSLEPRSSRPAWAT 


12925 


26826 


A 


13062 


127 


386 


FSYYYYFFFLFFXiCVFVIPGEPFLKMKL 
GKPKTXMLKEKKGGESYSELFFETDQME 
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peptide 
sequence 


M 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
(^Cysteine, D= As par tic Acid, 
E=Glutamic Acid, F=Phenyialanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














WL/TPVIPTLWEAEVGGSFEHRSSR 


12926 


26827 


A 


13063 


1 


209 


GKY I LLLKVFC CFLRQGLCHLAHCS LRL 
PGSSDSRASASQVAG/ISITGVHHHARL 
NFVFCFFFLVCFCF 


12927 


26828 


A 


13064 


132 


1 


SHAM CCNYLKRFG WAR WLMPV/ IPALWE 
AEAGRSPEVRSSRSAL 


12928 


26829 


A 


13065 


269 


134 


TLWSVFASS/WAWWLTPVILAPWEAKAG 
GS PE VRSLRI CKQKVLS 


12929 


26830 


A 


13066 


853 


461 


RLWRSVSQFLCACVHMCVPSVCVCGI.AH 
AGTYAVSTCVH/MCVCVCMCARITLAKI 
CS ETSGNLLFRKI I SLLDLFTLNFS I KY 
KVQLVLKSRSLAWWGRI KLLTLSRP FSD 
VNTFERINYWLIND ISIIR 


12930 


26831 


A 


13067 


57 


2066 


AQ P TGRQT S T VAP TP ATAR S RGGRVP I F 
PCPHPAPTLLSTSRWTPAS PAASMKAL 
RLSASALFCLLLINGLGAAPPGRPEAQP 
P PLS S EHKE P VAGDAVPGP KDGS AP E VR 
GARNSEPQDEGELFQGVDPRALAAVLLQ 
ALDR P AS P PAP SGSQQGPEE E AAE AL LT 
ETVRSQTHSLPAAGEP\EPAAPPRPQTP 
ENGPEASDPSEELEALASLLQELRDFS P 
SSAKRQQETAAAETETRTHTLTRVNLES 
P GP E R VWRAS WGE FQARVP E RAP L P P PA 
PSQFQARMPDSGPLPETHKFGEGVSSPK 
THLGEALAPLSKAYQGVAAPFPKARRPE 
SALLGGSEAGERLLQQGLAQVEAGRRQA 
EATRQAAAQEERIxADLASDLLLQYLLQG 
GARQRGLGGRGLQEAAEERESAREEEEA 
EQERRGGEERVGEEDE EAAEAAEAEADE 
AERARQNAL L FAE E E DGE AGAED KRS QE 
ETPGHRRKEAEGTEEGGEEEDDEEMDPQ 
TIDSLIELSTKLHLPADDWSIIEEVEE 
KRNRKKKAP PE P VPPPRAAPAPTHVRS P 
QPPPPAPAPARDELPDWNEVLPPWDREE 
DE V Y P P G P YHP F PNY I RPRTLQ P P S ALR 
RRHYHHALPPSRHYPGREAQARHAQQEE 
AEAEERRLQEQEELENYIEHVLLRRP 


12931 


26832 


A 


13068 


282 


53 


PGFPQGKFRFKKKILPQDYPFQGAPNSK 
ARPGQGVPPGIPA\LWRVKKGGPLRSGG 
PGPPGAKGGTPFLPIPNTKKS 


12932 


26833 


A 


13069 


177 


2 


VSLCRPGWMECSGVISAHCNLRLPGSSN 
S / PAIiASRVAGI TGARHHALLNSLFKVL 
SRA 


12933 


26834 


A 


13070 


789 


302 


PHPEFYYSFQYFIYFFISFHFTRHFIVH 
YFISSHFISCHLASSSSSSSSSSSSSSS 
SSPPAFHYSFHHFISFHLiAIHY\HFI IS 
FHLFHLFSSFHSISFRDRDLLYCPGWSV 
P PHPAFH YS FHH / VHVMS RH F I LRQ TR S 
I AQ AG I QWRDLCS LQPE VGS S RS A 


12934 


26835 


A 


13071 


193 


2 


PPYLKSHSQVDLSPGVQDQPGQHRETPS 
/LTKM/TKVSWVWWCTPVIPARWEVEVR 
ESLEPRCSTRA 


12935 


26836 


A 


13072 


122 


1495 


LLSDFFFFETE / SRS I AQACMQWC YLS S 
LQSLPPGFKRFSCLSLPSSWDDRCPPPC 
LANYC I FS RDRVL PCW PGWS RTPDLR 


12936 


26837 


A 


13073 


178 


347 


CCLCNDNSVLLCMYYLLVCMHACMHALY 
IKTWK/MQLGAVSHACNPSTLGGQGEWI 
T 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


12937 


26838 


A 


13074 


165 


34 


FILSFPFE T VS YF VAQAGVQ W \ L I 1 AHC 
SLDLPRSSDSPTSAP 


12938 


26839 


A 


13075 


205 


105 


GGWVQWFMS IIP / AI WEAKAGGLLEPRN 
SRPVWAT 


12939 


26840 


A 


13076 


3 


357 


HERE YTQAEKRKF I AFFFFKMGS CS VI Q 
ARVQRHDLSSLQPP/RVAGTTGTHYHTW 
LI FVFS RTDSRLPS S PRS PLTS VI SL 1 1 
PLSLIPTTYLTTPSRPSSTSPPSHPSFR 
APLTSDP 


12940 


26841 


A 


13077 


3 


477 


STPALTARGLSGGSRASVAAMVLLESEQ 
FLTELTRLFQKCRTSGSVYITLKKYDGR 
TKPIPKKGTVEGFEPADNKCLLRATDGK 
KKI S TWS S KE VNKFQMAYSNL LRANMD 
GLKKRDKKNKTKKTKAGSNSS /ATTAAP 
AAAATAATTAATTAATAAQ 


12941 


26842 


A 


13078 


1077 


1566 


RQVTSLDPLKTFPRTSLEFTSPTGQLDS 
GATAFTWWP S GAP T P RGRE S AAHQGN / 
YPVGQRS\RMAGLSPRSFCWWEVSFSR/ 
SHSC/I CWVQLGKS AVLPQQSRGPGAHE 
GSRRRGGSSPPCPPLQTQLVFSPQELSM 
SAPIHRLSVTLQGDCISTGGAPSRGRIiA 


12942 


26843 


A 


13079 


3 


396 


HERGRKMVS VT \RL I QRLRNWASGHDLQ 
GKLQLRYQE I S KRTQ PTTKLPEGPSHKL 
FNNYY\CTRDGRRESVPPFIIMSSHKAL 
VSRMPAYS FAEAATDKKAVTPAP S I YRG 
EQSSDHPYLRHCTLTVTRLL 


12943 


26844 


A 


13080 


467 


109 


SLVPWSPHLKEPSCRYLIENVMLVPKE 
QEFWSKEAGSLFSPPVPFYCWTPSIVFS 
SSFFFLFRCVCFFFFFFCFFFFFF\LLF 
FGTKKAL YLLRARG CKQ YKNQ KL I WYL T 
FLSLLVK 


12944 


26845 


A 


13081 


458 


51 


VSLFCTVAPLLLLPEGIFFSCSLLEVKA 
PMRDVSGVFLTLFLYSVLFFPLGVFFFL 
FFFLIIiPPPKIFFFFFVCVALFFFSPAF 
GAVFLF\SLWSSVLPGVGFFTACSFFFF 
FFFFFFFFLRQSFALVAQARMQWR 


12945 


26846 


A 


13082 


23 


327 


RSD S EGGRRGHKY I F I P I YTHTHT YI F I 
YTQIYI CLHIYACLS /LICTHI FI S I YI 
HTYIYISIYAHIYLYIHTHIYLHIYTHT 
YIYIYLHIHIYSSWVCKGS 


12946 


26847 


A 


13083 


352 


153 


KKLVEYYSAIKKNSVLIH/VSTWMRLKN 
ITLSLKTQSQRHLYYIIPSVRNVQNRQV 
HKYRSRIiVDSED 


12947 


26848 


A 


13084 


121 


417 


DYDYYCYYYPKIYLTKLLYFY\VGNYIY 
THI YVRHVYI CET YMP VYTH I HT Y I CI Y 
IYIWKETTWLFSCGISKTYRTHDIAYTY 
IiVF CLTHVT FHVS S T 


12948 


26849 


A 


13085 


242 


12 


LSSRLPMLLETKEITSKILPSRKL/LPY 
G WC F KS F K I T W VS Q I RNRL \ AQ WL T T VI 
PQLWEA\SGLLKPRSRRPAWTTW 


12949 


26850 


A 


13086 


274 


340 


GGEKKTPGGFLEKKS FFGGGI FGPPP/ P 
TKRGVF P P S PKKF F F P P KTKNFGRGGG P 
KIPPPKKDFFSKNPPGVFFSPP/YKKKK 
I IFPPPVKLGPPKDFLKSPPPFFFFFYF 
FFFFFFSSQEVNLKALiSSTSEIIFFYIiV 


12950 


26851 


A 


13087 


241 


405 


PPPSKIHI GNFRKMS D VKNATGWAW WLT 
P VI P ARWE AE AG \ VS LRPRNSRPAWAA 


12951 


26852 


A 


13088 


537 


375 


KHSLTLSLRLECRGTILAHCS /L/CTSL 
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Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H— Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \-possibIe 
nucleotide insertion 














LGS S DL S LxQPQQ I AGTTGACHHTHL E K I 
F 


12952 


26853 


A 


13089 


448 


186 


GALVFPSSQAPPCPPKMEQSFFFFFFFL 
ETGSRSVSKAGVQLHHGSLQPPPPGFER 
SSHFSVTTI/TYLYHHRLALHVFELHIN 
GIMHR 


12953 


26854 


A 


13090 


365 


191 


GQEREKEREREKRRQKKREVE / REEEKE 
RVRDLESESNRALERERSALVDRAPLSR 
PG 


12954 


26855 


A 


13091 


148 


316 


DKME S CS LAQTG VQWHD \ LG \ YLQP S P L 
GFKRFCLSLSSSWDYRWETSTEPFQQAV 
F 


12955 


26856 


A 


13092 


712 


229 


FVAWVKMAEYLAS I FGTEKDKVNCSFYF 
KIGVCRHGDRCSRLHNKPTFSQEVFTEL 
QEKYGE IEEMNVCDNLGDHLVGNVYVKF 
RREEDGERAVAELSNRWFNGQAVH/GEC 
TRGGFCNFMHLRPISQNLQRQLYGRGPR 
RRSPPRFHTGHHPRERNHRVFP 


12956 


26857 


A 


13093 


835 


405 


EL IERLGPNQKPP I LMWKPMPS PSNMKA 
SAALLCLLLTAAAFSPQGLAQPVGINTS 
TTCCYRFINKKI PKQRLES YRRTTSSHC 
PRE AVI FKTKLDK\ E I CADPTQKWVQDF 
MKHLDKKTQ \ TP KliLNIHDWNLENQAMT 
LRKPN 


12957 


26858 


A 


13094 


1137 


424 


LCP SHFAPTTLTQ PGAHKNMCC I KSRFK 
RDLGLCRTCLVNKMFTSS ILGKSHRHSL 
VS I NQGNNALWKAAG \ PL S W KAG YC \ QG 
FSPCDSLKYG\SWDEKDLTVPQPDTHKG 
S VLRW I S KRG KP LAVE I E GRATGLPGLA 
P WGTE \ CLG YKT P I V\ HL FNS E MG\ ENR 
PYGGEARHVCSNAALLFFTPLRCLGGEK 
HKSGLRAHPVIVLSLELNYDIDSFAHMF 
FADLLLIITLLSCYIPFC 


12958 


26859 


A 


13095 


1 


324 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERE R P / RPRVL F F VRGRE GS HP E KCEN 
KKERAPFSRALRKISIETRGGVSITTEA 
CVFVIGAHRVCRSVYIYI 


12959 


26860 


A 


13096 


508 


840 


DGVSLCRPGRTADCSGAISAHCKLRFPG 
SRQSPGLSLPSSWDYRRLKPMRPANFFF 
C I FF \ VKTGFHL VS Q / AMGLDLLNS /SI 
PPRLGLPKCW\DYRREATAPGQELLLWR 
TGM 


12960 


26861 


A 


13097 


53 


254 


WPQTASCLVAQAGVQWSDHSSLQRQTPG 
\ S SHP P AS ASQ VAR I TGMHHCAWLHL 1 1 
LLRAHGS S PLC 


12961 


26862 


A 


13098 


329 


67 


GPGAPLLKGEKLLPKEPPPRISPIVSAF 
FAWGPPPGYCFLKKGGPFF\VFPPSFFG 
KKKNSPPPPHINGGAKTFPNPPLLFSSF 
FFF 


12962 


26863 


A 


13099 


253 


444 


EGAFSLGT FYF I FFLVLSNTFLLS I KNQ 
QI WNKKT W\ PS PFLPTLIALTTLLLP I 
SPFILIIL 


12963 


26864 


A 


13100 


341 


590 


PMAS S Q I C I GRI P P FL YLKW YF WPGKW 
HTCNP KTLGGRGAL I TQGQKFENS LAN\ 
MAKPHLY 


12964 


26865 


A 


13101 


55 


184 


RLRE PTPLYSE / HAP T KY YRMAHHN YPH 
S PPI PQPRHHYRVFLL 



1331 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 


12965 


26866 


A 


13102 


433 


762 


RNLLNCKRGLERKGCRASWPAVPLFLQR 
RWPRPTAELNEACMLAQHQNCQNVKME 
YFQLS FTKRQE I /WLGWAHACNPSTLG 
G \QGGG I ACSQE FKT I PS S KVRP CF YLK 


12966 


26867 


A 


13103 


349 


250 


DLFFSTKPPPPPPPPKNLGPP\GPFGPP 
FQEG 


12967 


26868 


A 


13104 


3 


236 


YPQHYPSCPTFQYTLKTKNHNFLIFFEV 
GSCSWQARVQWCHHGSLQPP/NSSDPP 
ASAFQVAGTPGVSHRAPPKNQNF 


12968 


26869 


A 


13106 


154 


4 


KKVKNTVHYNGKYFLKSQNQSTFLKEIIi 
/ WLGGVAHTCNP STLGGRS GRIT 


12969 


26870 


A 


13107 


391 


150 


RPPKKPNKKYSQITSKGIKMVYTRKYLT 
Q KKKAMS QALW Y / V / HAVPATQE AE VGG 
S CE PGS S RPRCTM I MPMNTHCAPAWAT 


12970 


26871 


A 


13108 


90 


252 


K VGDGGR CL VF C FY F KRQGF T VAQAG VQ 
W\ATIAHCSLKLLGSSDPQNYEDSWGT 


12971 


26872 


A 


13109 


197 


414 


LYEFIKITVLLGLGCPLGRYGCSNQRFF 
S LEME FHTVAWARVQWHDLDS LQPLP PR 
PNRFSCLR/YPQSSWDYR 


12972 


26873 


A 


13110 


2 


317 


GRVGANVHKGHRQRTYGSVI PHI LPLHV 
L KKT F S LRDFHF S VS L KK\NL VLT CLHL 
FL/GVRTPRNDPFVSMMLLFTAFLDRPS 
T I LGTGLLYTEGLT VALRIAYL.R 


12973 


26874 


A 


13111 


413 


162 


LILLPQPAECLQLQASAATLDCLGLPRC 
RDCRRS LVHS VLNGAQAGVQ WRDLGS LQ 
PPPPS\TCLGLPKYRDCSLCPAATPSGK 


12974 


26875 


A 


13112 


40 


296 


ESRLSALYIYHICVCVYNREHLLYGIHM 
/SIHTYTHTHTHTHIYIHMRAFALEDKF 
I CSLLCSRQDNAF I LVSVKLQRKNYFLT 
RR 


12975 


26876 


A 


13113 


2 


288 


FEPRCKNSARGKVPRGFSPNLPHVTVET 
PEGSKTGFEDESR/RRHRQIHSSAQCPL 
TAVPGAGDAI PEDASGWHTWLPLHAQNC 
FLLYIQAPEQPPA 


12976 


26877 


A 


13114 


685 


340 


LS PPRAGS ARPTGP PTDAPGQRS TWTCG 
ALKPRRRAIiRDAAENLFQELQEHFQALT 
ATLNLRMEEMGNRIEDLQK\NVNDL\MV 
QAGIENS I KEQMLKTVTANMS'VFGDGAS 
YRSH 


12977 


26878 


A 


13115 


65 


3021 


GVLTQMGDEKDSWKVKTLDEILQEKKRR 
KEQEEKAEIKRLKNSDDRDSKRDSLEEG 
ELRDHCME I TIRNS P YRRED S ME DRGE E 
DDSLAIKPPQQMSWKEKVHHRKDEKRKE 
KCRHHSHSAEGGKHARVKEREHERRKRH 
REEQDKARREWERQKRREMAREHSRRER 
DRLEQLERKRERERKMREQQKEQRSQKE 
RERRAEERRKEREARREVSAHHRTMRED 
YSDKVKASHWSRS P PRP PRERFELGDGR 
KPVKEEKMEERDLLSDLQDISDSERKTS 
SAESSSAESGSGSEEEEEEEEEEEEEGS 
TSEESEEEEEEEEEEEEETGSNSEEASE 
QSAEEVSEEEMSEDEERENENHLLVGKN 
LPGDRVPVPESRFDRDSGESEEAEEEVG 
EGTPQ \ S SALTEGD YVPDSLPLS PI ELK 
QELPKYLPALQGCRSVDEFQCLNRIEEG 
TYGWYRAKDKKTDE I VALKRLKMEKEK 
EGFPITSLREINTILKAQHPNIVTVREI 
WGS NMD K I Y I VMN YVEHDL KS LME TMK 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
TJSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, I>= As parti c Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Ij=Eeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QPFLPGEVKTLMIQIjLRGVKULHDNWIL 
HRDLKTSNLLLSHAGILKVGDFGLAREY 
GSPLKAYTPWVTQWYRAPELLLGAKEY 
STAVDMWSVGCIFGELLTQKPLFPGNSE 
IDQINKVFKELGTPSEKIWPGYSELPW 
KKMTFSEHPYNNLRKRFGALLSDQGFDL 
MNKFLTYFPGRRI SAEDGLKHEYFRETP 
LPIDPSMFPTWPAKSEQQRV\KRGTSPR 
PP \ EGGLGYS QLGDDDLKETG \ FHLTTT 
NPGGLLPRGPGFS LRVAE VS ELDP V I GR 
WSAGDHRRGYL PAWNCDGTRNS S S YFVL 
PCFVFVFLVCKLSRIKSFSLLWRKELCF 
LRDLPRAERVGI FRGAHVGQHKPHHHPL 
PLS TRTGIjGWAVI WKGTGGS RVDGL I FG 
AGDLFLCWDERCPLAPTHSSRPALSTTS 
PRPPTTQPQLIiPAGLEQVFYYKWGVLK 
CIKIFFEE 


12978 


26879 


A 


13116 


469 


202 


MIIIVGLCEYTKSAVLVQHSDPLPPAPG 
VFFLAVLDWDIKIWHPVFCLFVCLYE 
MES CP VAQAGVQ WRDLGS LQ P P P \ PGS S 
TSCF 


12979 


26880 


A 


13117 


193 


383 


AARMASTFFVFWFWFSTF\SLYSICVCV 
CVCVCVCVCRMHTVQLYFFQVGCQCSSL, 
LNPLFPMN 


12980 


26881 


A 


13118 


146 


350 


HYLVKFLLLAHPGSKAPPLNTIiQPPPLA 
TREQPPLTVIFHYPPTSYKMAP\PYLPS 
LTLFGLSPPAPR 


12981 


26882 


A 


13119 


124 


254 


HALLQTIiRDLSQAWWHALVI PATW\EAE 
AGGSLEPRSSRPAWAT 


12982 


26883 


A 


13120 


150 


45 


SLPRLECSFTVLAH/C/NLHLLGSSDSP 
ASASQWGGI 


12983 


26884 


A 


13121 


336 


446 


IPLKI SS WAQWLTPV/ IMPAVWEAEAGG 
SLEVRSSGPA 


12984 


26885 


A 


13122 


181 


39 


QNLTAQDGVQWCDLGSLQPP\LPSSWDY 
RRESLCPSSFFIFWKRWGFT 


12985 


26886 


A 


13123 


463 


2 


GPAVPSGLYLKGKPMRRLCASHSPESHS 
HDKGQGTLPRIPQ/PLLGEGGSR/PLAW 
GAVAGLPSNRPRIVPLPAPTRSGTRVRP 
HTGHQPGMSPGCGCESKWPHSAAAWPEA 
YPHFFLLPFPTQGCEVLGPLYTADPWVI 
CVSLLPSCPNSTAVDVD 


12986 


26887 


A 


13124 


429 


263 


DHF S FL C VTG S Q S VAQAGVQ W \ L 1 1 AH C 
SLKLLASSDPPASVFQSTRITGPFLSFT 


12987 


26888 


A 


13125 


310 


486 


NKKTTRGE S S KQRQ P R VSHH P GWGT VAP 
SGFTAASAS\RARVILLPRPPE 


12988 


26889 


A 


13126 


124 


461 


GSPLQLLPSPLAAIiTRDCSEAPMGSCSV 
AQDGVQWRDLGSLQRLPPGFEPFSCLSP 
TPAP\FPSGWDYR 


12989 


26890 


A 


13127 


2 


337 


RGAAPAAMAVTAIiAARTWLGVWGVRTMQ 
ARGFGSDQSENVDRGAGS IREAGGAFGK 
REQAE E E R YFRAQ S RE QLAAL KK\HHDE 
E I VHHKKE I ERLQKE IERHKQKI KMLKP 


12990 


26891 


A 


13128 


681 


422 


CRSDRWAKEHRGKRGQDSSKDVMARLME 
APKQTAQYFFI FYFFETKS YSVTQAGVQ 
WIiDLGSLQRPPPG\SSDSPASASCAWPQ 
T AH 


12991 


26892 


A 


13129 


299 


579 


L VMF C KVTK I E RLNL WRPGT VAHVLWS Q 
HFGRPRWAVHVGWGVRDP / RLTQHGE T P 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L?=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /=possibie 
nucleotide deletion, Y=possible 
nucleotide insertion 














SLLE \NRNI SQAWWCMP VVPATREAEAR 
ESLEPGGQRLQ 


12992 


26893 


A 


13130 


268 


438 


KVDKSMKMRKNRP KNAENS KMKKAS S P P 
NDCNSSPARVQ/SWTENEFDKLTEVGFR 
KW 


12993 


26894 


A 


13132 


1108 


1529 


KIHSS FVLLNAKLVAVAAKTPKFYKKKT 
DLPQTYHSAQTAVPEIPVSAHSTFIRSF 
IHSFIFETESYSVAQAGIQWHDLPS\PP 
GS \ SDS P \ AS VS RVAGT IGVCQQ AWL F F 
V\FLVET\GSFPHLSPLTMIRNKKQLFK 
KGH 


12994 


26895 


A 


13133 


497 


370 


RPSTVSPRLECSGLILAHCS \LDPPASA 
SQTAETTYGHHHAS 


12995 


26896 


A 


13134 


217 


373 


CNHTE I TYVTSXKKKHRPGTVAH / S L YN P S 
TLGGQGGLITQAQEFDTSLGNMTLS 


12996 


26897 


A 


13135 


257 


507 


PVSPGFPPL\CPVPHTPHLCPPCRYPES 
PGQPAAQHCGAPRQPQPNPRACSSRGLF 
RCPPACHHAPSCLLCVCPPLPQCCRVQG 


12997 


26898 


A 


13136 


137 


309 


PSFPTHPQFCFLFCFLFVFETE /SSIVA 
QAEVQWCDLGSPDQGSSASPPSASSLPL 
SIW 


12998 


26899 


A 


13137 


297 


419 


GGSAVYYIWEIVQVLWLMPVIPT\WEAY 
AGGLFEPRSSRPA 


12999 


26900 


A 


13138 


213 


492 


QVGINYQPPTWPVGDLAKVQRAVCMLS 
NTTAI TEAWGLPGAI SSAKCALVHW\ YV 
GEGMAVGEFSEAREDLAALEDYETCGPW 
NPVETEAERR 


13000 


26901 


A 


13139 


246 


534 


DRVLPLLPRLECSSAILGHCP/APASQV 
TGTTGTHHHTQL I FLLYCWLCP P PLAS E 
AS AE S P PLLPRGFLCFSSTTS SITAIPA 
TTRDYVDVSRSGS 


13001 


26902 


A 


13140 


1 


206 


SLEWPFFFSFSETGSHSFTQAIVQWCNH 
SSL\SPTSASQIAGTTGVHNHAQL/VFV 
FCFAFMSPQL.SNSC 


13002 


26903 


A 


13141 


164 


582 


GL VLVNLYLRS YLM VF I YINS RLLQVTD 
ASRERRENESSWFCRSVCRM/PPVGPGP 
S CVHP CS PRGS ISP CTPHCPLKTTQAPA 
\ PPRPIiPRVGPGPHLDQSCCVAARPLYD 
P I LVAAHS S WPAP P S P VP VPQEDTEHGA 
G 


13003 


26904 


A 


13142 


235 


367 


LALNTFCWLGWAHACNPSTLGGQGEWV 
TRGQE \ QANMAKTCL Y 


13004 


26905 


A 


13143 


91 


403 


GALQ PATAP WE P L S GLAE AGAI S P CLQG 
G/ SGGRGAS ENRGCTR PGR VP GGRRLNR 
PRAWSSWPAPPAPGSERPSAINCPRAEE 
CGRRVWDWQAALPAAPAWVPV 


13005 


26906 


A 


13144 


558 


147 


GHSFQNWLLGCPLQLQRHLLHPPDASEE 
CGSPPWTPLRPHIHPSAGPHGSTHK\HA 
GGCLFPLRPSPTHPCGPPVPLPWSQ\QA 
PLCPHPPIHSPHDWECDPGKSALIPPPA 
AQS PSRL I KHAANE PE CS PRNRRPG I P 


13006 


26907 


A 


13145 


321 


421 


GCAQWLTPV/ IPALWEAKAGESLLFRSL 
RPAWAI 


13007 


26908 


A 


13146 


3 


398 


IELLQSHDKTLTDKKIiLLLDEQRKWFLE 

mestpgedvnnvelttkdleyhinlddX 
kagfer i dsnfervs avg / kmlsns i ac 
yreifhersvkvanftvailpqphklsv 
tinliiqqpstlrqdlllakr 
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SEQ n> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F— Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


13008 


26909 


A 


13147 


228 


261 


S KTQRS S E YY SLS VCVCVCVRVC IYIYI 
FFFFETEFLS\APQAGVQRCDLSSLRLP 
PP\GSSDSPFSTSQSAWDYRREPQRPTS 
LHYFVQLKHNLHRVKRRDLRNTTR 


13009 


26910 


A 


13148 


3 


275 


KHLSDLQ ILRLGLS S KTRKTRYSHFAQG 
RSIFFLLRPSVDWIKPACMDGNLLFSKF 
TNLNVNI \TKKHPHRNVQNNVWPNVWAP 
WPKEVDT 


13010 


26911 


A 


13149 


218 


382 


AWDH I KLE S F YTAKETT / RMKRQ PTE WE 
KIFANYPSDEGLVTRMYKELKQLYRKKI 


13011 


26912 


A 


13150 


391 


29 


FCFLFLFMFLFFFEMQDGVQWCNLSSPQ 
P P PPG \ S SHS PAS AS Q WEAENCLNPGG 
GGCSLGDRARLPQRKRERERETERKRKR 
E RAS CAF CGAI T F VNMW CQ KHHT CNHL P 
WLTVFETNKL 


13012 


26913 


A 


13151 


351 


157 


FG YSR FL C VYI NFRTNL S I YN Y I C I Y IN 
LYL\NIHTYTLIYTYVHTYTHIHIFYIY 
IGILTGIT 


13013 


26914 


A 


13152 


402 


111 


TFFGGEKPPFTLRPGPPRREGGGGPPGP 
REGGFFRVPLGAFGPSRPK/RPEKSGPV 
WAPQIFLWGGKSRAIKESGKGFFFSRIT 
LLVFATEKKKKTTLF 


13014 


26915 


A 


13153 


310 


141 


RE HE KQKP L S Q VRWCT P L VQAAQ E AEAG 
GELEPRSSRLQCSMT\TPVNSHRPPARV 
T 


13015 


26916 


A 


13154 


153 


21 


LNKCVISWLGAVAHACN/ PSTLEGRWIT 
QGQEFKTSLGSLAKPHLY 


13016 


26917 


A 


13155 


429 


275 


S VD I P L E KTL INKD VF KE AAF KG KVS / R 
QARVKFKKRYKPVKTKWFFQKLWF 


13017 


26918 


A 


13156 


303 


42 


YRERAPVYP I I IN I AFGPHKQ \ TRAHTH 
THTHTRMHAHTHTHPAI I PAFAKNGLGT 
FHHKGRT YNDKVKPRLS CEAAI P S WLKS 
REFR 


13018 


26919 


A 


13157 


198 


13 


SHGVAILLFFHFLNKRAFT\YSPAMNSS 
LCEIQEPSLG\SGSGPLSGNRMSGKGHR 
KYFAKI 


13019 


26920 


A 


13158 


192 


2 


ISNSHRKDIKDPEVLLIEQKYILSGQMQ 
WLTPVIPAL/V/EAEAGRSLEFRSLRPA 
WATDRDSVSK 


13020 


26921 


A 


13159 


40 


190 


RVDPRVRE S RS VAQAGMQWRDLGS LQAP 
P \ PGSRHS PAS AS Q VALFLNRK 


13021 


26922 


A 


13160 


1 


178 


PTMVLS PAD KTNVKA/ AWGKVGAHAGE Y 
GAEALERMFLS FPTTKT YFPHFDLSHGV 
SSYL 


13022 


26923 


A 


13161 


411 


41 


ESLQICVGLHAKPSWDACRPWLGQAWFR 
YFLLTFLGPLVFLLVFWFSKIYFLYHLK 
NP S LS S C YNLD \ FNTQYLKYCKKKKRLG 
AVAYACNPS ILGSQGGRITRGQEFETRL 
GNMAKP DAWVD P 


13023 


26924 


A 


13162 


372 


218 


FWKRSWISKKGLFKRGVSLSLSLS/CDT 
HTHTHTHTHTHAHSGYLCVLRKVK 


13024 


26925 


A 


13163 


247 


89 


QGLVLSPWLKCRGGITAHCGLLGSSHPT 
TSA\QVAGTTGTQHHAQLKFFILI IL 


13025 


26926 


A 


13164 


122 


409 


FPSTAIANSH/CSSPQGCSAYGVAIRLF 
LRSSNKLAL\LYGLAL/NSFFCNFQEPS 
\LGSLSGPLSDNKSNNLCCFKSPSSWLF 
AIAVLGNYYSLNTVRSGRRDPGTPNACS 


13026 


26927 


A 


13165 


69 


313 


PSVSLFFPPNHNIVIHMGLLKINSIFGE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K==Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine s 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *-Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














GIALSSRMECSVTITVHCGLDLLGSGGP 
PSAAS /RVAVTAGTCHHSWLMFLSFV 


13027 


26928 


A 


13166 


310 


26 


IKSMTPLPALNVSLSYYLPPLRPCRERG 
MVQDRFFCFLFCCCCFIFSETGSHSAAQ 
AVSQNKKTTGWAWWLTPVI PATHEAEAG 
\ E S LERGRQRLQ 


13028 


26929 


A 


13167 


296 


68 


KNGPLNLATKSLL I LTMKCLGVVLGGKP 
NCSKLKKNKIKTKKWGWAPWLTPILPA/ 
LLEAEAGRSLEARNLRPVWT 


13029 


26930 


A 


13168 


2 


235 


GATPCPREPSLFPLAGWAPLSCWAPPPH 
CTQGTTP PGLMGTG\ AP I AI I GGNARS P 
PMPFLPLSGTRRALSAQECPTF 


13030 


26931 


A 


13169 


248 


406 


TERCGFDLLSLGQVQWLTS VI PALE / EA 
EAGELLESRS WRPAWATWQNL I SLKK 


13031 


26932 


A 


13170 


33 


308 


NS LQGAEMAAAQS S LFS AS EGYRGERGR 
GRGRGR PGRG / R S E GDRGGRGR PGRGRG 
RKVLLLRI C VWRR VLRL WNRGR I G VGGR 
VTYLMESEI 


13032 


26933 


A 


13171 


294 


56 


ICKRNYLFFIYFLREGLTLLPRLECSGA 
I TAQ CSIj\GSSDPP TS AS T VS GTTGACQ 
CALSSRDALISLCYPGWSQAPGLK 


13033 


26934 


A 


13172 


71 


476 


DSLNYDNHIYQMPLIYINMGLAVTISLL 
G\ILVYRSHLISSLLCLEGIILSLFIIA 
TLITLNTHSLLSNIVPIAILVFAACEAA 
VGWEECGFRSRPTALILEGPLRSCQPRT 
CVRRPACTRRGPAFVGKNRHS IGG 


13034 


26935 


A 


13173 


86 


318 


NNCNVC VCVCVC VC VC VC IH I C I YV/ YV 
YI C I YTYKYI C I YTYKY I C I C VYIHVYM 
YMCVYVY I C I CMC VYI C IDSKGR 


13035 


26936 


A 


13174 


466 


726 


MCMKFGAVTRIG\DLPWE INPLSSCSLL 
REKDP PTTSGPQTDQPKKHLTNFKSGKR 
PLFTLFSNIjPHYPSTSFSFQSWRHTSIS 
PFS 


13036 


26937 


A 


13175 


128 


325 


FLRVI L I CHKM YGC VCVCM / YACI CNGM 
CIYMCVCVKSWKLKPELKCLCISDEVGW 
VLTQWAGREEN 


13037 


26938 


A 


13176 


134 


311 


INSNSEKMGSHYVAQDGVE/WQGIiFTGH 
GIAHCNLKLLGSSNFSAS ISQGAGTTGI 
VANG 


13038 


26939 


A 


13177 


577 


1052 


S PTSTRTGSGVAMWLSQPMGRT \ CRSKV 
ASKSRLHGPLIWAHPPIVRCPMLRHHYX 
KAQAGRGL S LE E \ LRVAG I YKKVAQT I G 
I S \ E DARRRNQ S T \ Q ALQAKVQRLK\ ED 
RSSL I LFPRK\ PLAPKKGDS S AEELELD 
TQLTGPEMPIGNVYKEKARVIAD 


13039 


26940 


A 


13178 


496 


678 


TLNFVWAQGLKITRPWKNAPVFPVIW\R 
LRRKNLLTLKGE I CSDPKWP YCLP PWKT 
KKNLV 


13040 


26941 


A 


13179 


192 


421 


GIYTFFLSFFLFLRQSLTLSPRAGVQWC 
NLCLPGSSDSPASASRVPG\ATGT\CHH 
GSANFLYFLVGDRVFGYVGPR 


13041 


26942 


A 


13180 


144 


5 


LQLAI KLLKMRLGWAHI CNPS / TLGGW 
G\GRSQGQEFETSLANIVKP 


13042 


26943 


A 


13181 


351 


51 


KKKKK I F FGTKKRRF F FLKG VWG P P P P K 
VSPPPPPKISPNPPKNNPLLKA/IKSPQ 
NWLFFFGPPSCKFFPPFFLNPKNSPWEN 
P P SGGEFS PKKKKKKSAI 


13043 


26944 


A 


13182 


2 


218 


IHFGCFVFLFFETEFCSCHPGCRAGVQW 
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RD\LAHNSLLPPPLG\SSDPPTSASQSA 
E ITGMSHHACP I FKKHS 


13044 


26945 


A 


13183 


33 


304 


E KHG I FRAVK I L CM I LQRWT YATHL S R P 
TGYTAPRVNPKLWTLGDNDVSM/SGSLI 
VTNTALGWGLL I RRS S ACMGQGDMGNL C 
TFLSILL 


13045 


26946 


A 


13184 


277 


13 


FS FFL I VEMGS C Y I AQAGLELLGSSNP P 
S S AS QS AGI TGMS HHP AWLATLKRAF VS 
D \NS L S F PS S ENAHL S I LAGHF CLMGF F 
FFQ 


13046 


26947 


A 


13185 


140 


361 


ITCCCVCIYTHIHTYVYVCVYIYTHTYI 
RVCV/YYIHTHTYIHIYIHVFIYIYIYT 
FMHT Y I YP YVCM VLLQLTL 


13047 


26948 


A 


13186 


64 


759 


CLSAESAPTSTMPMTLGYWDIRGLAHAI 
RLLLEYTDSSYVEKKYTLGDAPDYDRSQ 
WLNE KFKLGLD FPNLP YL I DGAHKI TQ S 
NAI LR Y I ARKHNL CGET E EE K I R VD I LE 
NQVM DNHME L VRL C YDPD FE KLKP KYLE 
ELPEKLKALTS\EFSGGKRPWF/SQGDK 
I TF VD FLAYDVLDMKRI FE P KCLDAFLN 
LKDFISRFEGLKKISAYMKSSQFLRGLL 
FGKSATWNSK 


13048 


26949 


A 


13187 


41 


426 


LFFFFFESGP C W VTQAGGRR / WNDHGS L 
QPGFPGLKGSPCLTLRRSWNYRPGMCRH 
TQLVFAFFFFPREKKFLLGAPTGFKTRG 
LRG P S RFGFQ KGGNKGQ E P P P RAKGL I F 
LEPERGQAFFGVFFGPTQ 


13049 


26950 


A 


13188 


39 


197 


FFFLFKTRSCSFIQAGVQWCEHSSLQPQ 
TPG\S S AAPAS AS CKAGT TEAALKL 


13050 


26951 


A 


13189 


221 


396 


GIQLQGAEGLWDVTNSPFCWRLYDQKTV 
YHECRMWANSHCP\PPKGLLRDITPRCW 
AP 


13051 


26952 


A 


13190 


2 


826 


PGS T I S S RRRGACG S RGGH F P S PRGGS G 
VASLERAESWSTEPAKAIKPIDRKSVHQ 
I CSGQWLSLSTAVKKIVENSLDAGATN 
IDLKLKDYGMDLIEVSGNGCGVEEENFK 
GL\TLSALKHHTSKIQEFADLTRVETFG 
FRGE ALS S LCAL S DVT I ST CHVS AKVGT 
RIiVFDHDGKI I QKTPYPHPRGTTVS VKQ 
LFSTLPVRHKEFQRNIKKKRACFPFAFC 
RDCQFLEGSPAMLPVQP\AKLTPRSTPP 
HPCSLEDNVITVFSSVKNGPGSSR 


13052 


26953 


A 


13191 


235 


1 


SPCARQCCPPNPAGQEPRRRLERGPGKW 
WPRS IKFPLPAV/RARFPLLPSAPLRQL 
VS GRVGGRVGR P GKAVQ VS GGLN 


13053 


26954 


A 


13192 


2 


386 


VIYLLLFFETESCCVAQARDGVSPCWPG 
WFQTPG\SSDLPASAPK/VAGITSVSHR 
TRPDVRGFKKC/GLQLEMSHLALLL/TT 
SHRFWYFVLLFSFVSKGFFIFFFFLFFF 
FFLGKGFFFFFPRGGGGGQI 


13054 


26955 


A 


13193 


749 


506 


GQTKAFVLSYCGASPSIKQ/PCPQAKDH 
PliEPSMHPEGTQLQSCSTMLGPRQLSSE 
KQPLLPPRSHLKSSPMLRACKGLTS 


13055 


26956 


A 


13194 


73 


297 


RMPGFGALGSPFSCQRIECPC\AGAGSC 
TYAS FCKCKEYKCTS CKKSECGAFPGNQ 
GAGQS QRRE PRAQQAGAGQ 


13056 


26957 


A 


13195 


670. 


390 


PRETYIKPFARSIGCQPPKTIHPTSRRP 
STRPPARPHARPPVHTP/APSVHMSDRP 
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STCPPSIH\RPPSTDVSRPQYSIiSPSTS 
LALPGQS CLGR 


13057 


26958 


A 


13196 


2 


438 


LSFTSI EFCHLL FLNQL I CNYVI FTKS S 
T I S CE ALGRLL WYP WTQRFFE S FGDLS 
T PDAVMGNP KVKAHGKKVLGAFS DGLAH 

Ij\PPLKPT f at vs e lhcaklh vd p enf t 

LjLjGNVL VC V / M KHN FGK 


13058 


26959 


A 


13197 


57 


499 


SPSWPRSSARPCSGHCLLPGLWDHGKAP 
CLRAARHS QGHG FRKGRKEGCGLWAAGW 
TKNPVPTWRAG/SRRGPGVTSPPPVPSV 
QNPP \ P PQRRHGGS QATRP VLFTPPPKH 
MEGPNAPILGDQRITAPPPEILYRPPRA 
ATCDALRRG 


13059 


26960 


A 


13198 


34 


465 


GILPYSPCVLRAPNSKRVRKCGCTTPRH 
L/CKATRKCGCTTPRAPLNETRKCGCTT 
PRHL S RQ LGNAGAL PRGHLNE TRKCG C T 
T PRAP LNATRKCGCS TPRHLSRQLGNAG 
ALPRGHPLKETRKCGGATPADAFYETKK 
IGGGL 


13060 


26961 


A 


13200 


177 


1075 


PTS S SMAFPKKKLQGLiVAAT I TPMTENG 
E INFS VIGQYVDYLVKEQGVKNI FVNGT 
TGEGLSLSVSERRQVAEEWVTKGKDKLD 
QVI IHVG AtiS LKE S QELAQHAAE IGADG 
I AVIAPFFLKPWTKD I LINFLKE VAAAA 
PALPFYYYHI PALTGVKIRAEELLDGI L 
DKIPTFQGLKFSDTDLLDFGQCVDQNRQ 
Q QFAFL FGVDE Q LL S AD VMGATGAVGS T 
YNYLGKKTNQI V\ EAFEQKDFSLALNYQ 
FCIQRFINFWKLGFGVSQTKAIMTLVS 
GDSNGPTPASTCRKPPGEFT 


13061 


26962 


A 


13201 


35 


464 


VQEFKTSLGNVAKPCLYKKKKKKSPLGG 
GVP P P P KKTWVGGTIiS PQKIRP AGPLI F 
P PPLPRGQQGE PLSPI KGGGGGGS S \ YP 
PPPPPCLTGRA/PPPSPISAKSSPSPPP 
CQYVYHPPLLPLFFFLHALRRATAPLFL 
FLA/RA 


13062 


26963 


A 


13202 


1153 


259 


AGGVLRLG WTGS RMAS DS GNQGTLCTL 
EFAVQMTCQSCVDAVRKSLQGVAGVQDV 
EVHLEDQMVTiVHTTLPSQEVQALL\EGT 
GRQ P VL KGMG\TGQ F S E SGGQ P VA I PGG 
GLiGTVQGWRFLQLTPERCLIEGTl\PG 
LE PGLHGLHVHQ YGDLTNNCNS CGNHFN 
PDGAS \HGGPQDSDRH\RGDLG\NVGAN 
AAGGAFFRMEDEQLKVWDVIGRSL \ I ID 
EG\EDDPGAREGQSLYPKITRELPGSRL 
SCGI\IAKSAGLFPEPQSKICSCDGLT\ 
IWEERGRPIAGKG\RKESAQPPAHL 


13063 


26964 


A 


13203 


281 


420 


VDGSKKYNEMPVI PALWEAKAGGLIi \EP 
RKSRTAWATREDSVSTKN 


13064 


26965 


A 


13204 


3 


1125 


SD S PQTPRMRVMAPRTL I LLL S GALALT 
ETWACSHSMRYFYTAVSRPGRGEPRFIA 
VGYVDDTQFVRFDSDAASPRGEPRAP\W 
VEQEGPEYWDRETQKYKRQAQTDRVSLR 
NLRGY YNQS E AGSHTLQWMYGCDLGPDG 
RLLRGYDQSAYDGKDYIALNEHLRSCTA 
ADTAAQ I TQRKWEAARAAEQWRAYLEGT 
CVEWLRRYLENGKETLQRAEHPKTHVTH 
HPVSDHEATLRCWALGFYPAEITLTWQR 
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DGEDQTQDTELVETRPAGDGTFQKWAAV 
WPSGEEQRYTCHVQHEGLPEPLTLRWE 
PSSQPTIP I VG I VAGLaAVLAVIiAVLGAV 
MAWMCRRKS SGGKGGS CSQAAS SWSAQ 
GSDESL IACKA 


13065 


26966 


A 


13205 


417 


13 


GHPRVMLQCHMGSAS LS IMTPP P IRSYQ 
P ALTRS S ATKP A\ TS AP S I TPMT P VTAN 
I S / VWPTANTTTTKLSNPTSS / STLSTP 
IWHPSILPAPPSTPAPLTTVTNSATTST 
I CFHVQLFQAHHCHHC IAAALEYTS 


13066 


26967 


A 


13206 


1139 


559 


GLSELSIHPSVARVQEGHGATPERSIPW 
PGGKPRRGGRGTGLREWGGGSHRSLGQT 
KKKGAKAKRDHCAHGP IAAFQGPFPRAT 
QLL,SPCHGCSWKENQPP*GPPGHACTAW 
E KTVTHGS CRP VTKP S PDPGFVGQAPGT 
KGNPAPGMDGCAGFRI SCHPVP I TKPLS 
NTGICVLLCYEFGFLIAIVGCWGFK 


13067 


26968 


A 


13207 


306 


332 


ENGEIRIYIYIGLYLIKKHWEEAGCGSV 
HL * SQHLKRLKWEDYLRPGVPDQPGQHR 
ETPSQKLNK* INNDF 


13068 


26969 


A 


13208 


48 


233 


GKQNHLKCLWCRLLFLIiSAVISRNVYIL 
T * LSLS LS QCVCVC VCVCVCVC VCVAI L 
ENYPR 


13069 


26970 


A 


13209 


250 


2 


YCEGDLGS SRPGVS KCREFS PRRDKRIM 
PWPGAVTYAYNPSTLGGRDGWIA*A*EF 
ETS LGNI VKPHLYTHTHTHTHTHTHT 


13070 


26971 


A 


13210 


615 


120 


SVFWPLLGGVSWSGYTEVRGPLEKAVCP 
LSELEYCAGRSAALFRAIRLFKDALSLL 
KLCPQPP FPLGAPLS P * EALVCWPLHV 
STCLS*WCPTPCPPERSWCVLFPSMYPR 
VCPDGLLPPVPLRGPGVCWPLHVSMCL 
LSLFNSLIiRLRTCGVWFSVPVLVC 


13071 


26972 


A 


13211 


239 


429 


FILLRNKLQPGTVACGCNPSTLGSQGKW 
I A * AQG F E T S LDNM VKP C F YL P QL KKKK 
KIKNLGV 


13072 


26973 


A 


13212 


954 


503 


VQLLPLCGRDLYLFAFNLPTYRCWQHFL 
SKPRAPILAGGSDRWPCPCCPFWLP*WC 
HVSAHAPQPFPCAKPEAPLPVGCPCSPA 
PSLEAALLILRAGLWLiRAPFCFSAHLL 
SCHSQCCQFPCSEKRSCLGGRDRHAASN 
PLSAVWPPGE 


13073 


26974 


A 


13213 


1844 


1147 


LHS Q I YS TAKKASLSMKGS RDKTRAAS S 
RPVPSVLGVPPWSTLLQHPQNMWPGPAQ 
QQGQPSGRQAWCTPGEAPGAEAAPQ*QP 
HPEEDHSGGPQASAALALPPSPPSQRDV 
QGETGMQGRSAPRSAS SSACCACRWSRL 
P CP QLLQRHPGIjRLjVS PAHS RP P G PAP S 
SSSGSGLVPGYLPQKGLAGLSAGAVLCP 
PGLLRVAHGAGYGP S AQMLHARLS S SSL 
RGKRFLRF 


13074 


26975 


A 


13214 


413 


60 


LCSKSCHGSFAMDCETiiLTFPRIiFYPIL 
HMPTVPGQALPVHTPFL.*LKLPLLPGIP 
SCHSPLFLD*AQESPPFGGLPGHPPGSG 
VLCLWFHSLKHPVLSGSSVSTPGSASPS 
RSRAL ; 


13075 


26976 


A 


13215 


67 


196 


GL S LS S P KVTHNATLMG C I FQN * KAF VY 
HSMKQKKI I LLFNMA* P * YPLDFGEQWP 
LHGSHAYSTIL*LDLFCKKEEE*DEIPY 



1339 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, 1— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
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V*CFMLLWKSTTM*KKI ILLFNMA 


13076 


26977 


A 


13216 


161 


2 


S VFFLFIDRRLYL I KS WAAYNSSTLGG 
RGRR I T * AQE FKT S LGNKARPRVYQ 


13077 


26978 


A 


13217 


57 


184 


CLTVCKDKVS L CYPG WS P W * S YLTAAS 
NSWSKAILPPQPPE 


13078 


26979 


A 


13218 


292 


400 


LKVKQGRGHLDLNQGPLDL* SNSLPLSH 
TPSHRRTV 


13079 


26980 


A 


13219 


186 


394 


YLVLS FLFYFLFI FL.SRVSLP I FFILE I 
*LGIVAYTCNPSTLGSQGGRIA*AQEFD 
TSLGNIVRPPSLI 


13080 


26981 


A 


13220 


315 


448 


KT KD LGVVDHV Y3STP S TFGG * GGR I T * GQ 
KFETSLDNRARSCLYK 


13081 


26982 


A 


13221 


3 


13 


LR P LRS F VS VERLWAS C CLGS E L WD KT 
KRKKRRELSEEHKQEIKDAFELFDTDKD 
E VI D YHELKVAM I ALGFDVKKADVLNI L 
KDYDR*VIGKISFGHSIEIVTDFQL*R 


13082 


26983 


A 


13222 


1 


715 


VDMHS RGS PELTHAS THAS GKMAAP WAS 
LRLVAPMWNGRIRG IHRLGAAVAPEGNQ 
KKKRTILQFLTNYFYDVEALRDYLLQRE 
M YKVHE KNRS YT WIjE KQHGP YGAGAF F I 
LKQGGAVKFRDKEWIRPDKYGHFSQEFW 
NFCEVPVEAVDAGDCDINYEGLDNLLRL 
KELQSLSLQRCCHVDDWCLSRLYPLADF 
VAGAFRWAG C PR I FRTGAKHGFHHLQ*D 
LSSGWATCPGTSPT 


13083 


26984 


A 


13223 


1019 


1238 


LTGAE YKKSD GLT E S VTANLE P E AP KVF 
SRLDDEAPVTVLPARLPLPAP *HTARPD 
PPQRRRSMQLAPARPFL 


13084 


26985 


A 


13224 


1534 


1181 


RWNSEPDYSAGGPRAGRGARLLPVPKSP 
GGGGAVAPGPQTPLRAERTAVRLS ISS * 
NS S TAP ADPGAAP PLHPQ P AE AGT W VS A 
FLRVI FCFPSLGFGCWEKKI AFVPFPPT 
DEKCL 


13085 


26986 


A 


13225 


328 


22 


CLES PRDRPFQGPAFPKHPPPPLAFLSQ 
PGKVQYFFESNCKSL.SSQEIKNSRSVPT 
SPPAPSPPPPPFLYFSLTAEGGEV*KET 
LDSDRAPPSSREAP I CQV 


13086 


26987 


A 


13226 


180 


5 


NSIDYKINVKNSNRPGAVAHTCNPSTLG 
SQGRRIT*AKEFETSLGNIVLLIYTIIY 
YI 


13087 


26988 


A 


13227 


237 


528 


VGL C QD P L LLT LTD S F CS VLWGG S HLAF 
HKNLYVVHIDLVMYFSRIS FYFTLSAAI 
CLSLYLGSHILSPFL^LFYCAYTTLYLC 
LITTLFFLIPKV 


13088 . 


26989 


A 


13228 


140 


366 


NATCLWHPMS PYAPTY* C * LS PGLLRWV 
LAITP I VLMIL I SSYNHVS I S YGGTFRS 
NFFLDQI ILTFLI YELHIV 


13089 


26990 


A 


13229 


257 


25 


LPAHSLGRDLSAQPYSMPRPGGELRAEG 
QSSLCSVARSSL*LHVCVCVCVCVCART 
CVNVFACVLLSTKVCLHPLPE 


13090 


26991 


A 


13230 


561 


52 


WPSCSSSGSPMLLPPVPPGSLG* *SFSP 
LPAAAALPAPCAAPRPAPLRPCGPAPTP 
AAAPAP AP PAAAS L S AARAAAGS P P GS R 
P S ARGARRRPS G P CAP PEP RLGPRAPS S 
APRTRT PS* GRAR S GG S AGNAP S ARRT P 
QGP PRAACS LARS W I WPAGRGGS CGRAL 
GA 


13091 


26992 


A 


13231 


335 


103 


CQKYSTSYCNKWIiATLVIICSRKI *DGL 
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KGQAWWLVPVI PMLWKAEAVGLPEARSL 
SSPCNMARAPALKKIFLIGQV 


13092 


26993 


A 


13232 


452 


712 


VGLEGED*PPFSPAQSQSLSCWPDCSCP 
PSWPPDQRPGERPQPPDPPESHGAEEEG 
LPERPSDA*ARPQPFSLHPGVFPPQNAS 
TQ 


13093 


. 26994 


A 


13233 


480 


2 


SCFSEDWNPLKFKLQLSTSLSYRKSHTL 
HCAHHFRGQCHSD*AFLVFSLTCILQVG 
S LR F TAGD VS P S Y P E KLRVL * E GRVAD F 
HVMARKGQQAALAILLQNARTGYWWLT 
PVI P ALiW KAKAS GS VKPR S LRQ TWQNGE 
TRLLDRGESASQSAGMTGM 


13094 


26995 


A 


13234 


692 


824 


QIMALPFPSPPVPPLVM*CKGGKIPPPP 
SKNPRTCVGN TVI I KA 


13095 


26996 


A 


13235 


657 


872 


PRGPRLDVLYACE PQL I TCLELGTPLKV 
RFWLEVVDHTYNPSTLGDQGGRTA*GQE 
FKTSVGNSKTLSLKKK 


13096 


26997 


A 


13236 


168 


365 


QESLCILQTCTCAHQNLK*KFGGKSRLG 
AVAHACNPSTLGGQGGRI I *GQEFETSM 
VNMVKPYLY 


13097 


26998 


A 


13237 


31 


338 


WYIVYICVCVCVCIYIYEIYMIYI*YI* 
Y I Y I KHT VQ * H * HLAFS TFTLLCNCHHC 
PSLELFHYPTLTYHTH*IITPHCSSC*Y 
QPLFYFHLYVFDYSRYLV 


13098 


26999 


A 


13238 


714 


1001 


SSGVS CLNRAGW I TGVNHHI HS LVLLKR 
EK*TQNIITEVCLMFLLLPHLAIEATVT 
WPGMVAHACNPSTLGSQGRRSV*AREFE 
FSLDNIARPCL 


13099 


27000 


A 


13239 


250 


406 


NLA* HGGVHMWS SAS QVAGI TGTRCHAQ 
LIFIFLVKMGFHHVGQDGLNLLTL 


13100 


27001 


A 


13240 


256 


193 


WINFISLPLSSLT*TLSLCVCACVYVCV 
CVCLRPCVHIVGGNLKRYIVFARSVTLG 
CKFTHLHISLIINNDT 


13101 


27002 


A 


13241 


1 


257 


GMDLWQMCHSLSLPLIFFFFFFSEKTRS 
NFVPQAGFQLRALRDPPA*ASREAGITG 
LTHHVRPGQIFKKIGTLKPTILLPLLPR 
K 


13102 


27003 


A 


13242 


319 


406 


KRGWT * WLMP VI PTLWEAE AGGS PDVRS 
L 


13103 


27004 


A 


13243 


228 


56 


KGAPPVLKPGYPKAAKNPTPFKPPGSKV 
GENPPLF*NPCPGGGNPFPTFFFFFFFF 
F 


13104 


27005 


A 


13244 


1687 


612 


I LGNQ C CKFDAYNALtANE S TM I S I KLNE 
*VREREREKEKEKERVRERKEEREKKKL 
TSSKGTGSTATFHV 


13105 


27006 


A 


13245 


220 


1 


TKDPSSTMPPQPNPLLSFKSQFIFLIFL 
F I KKYVGWMQWLL P V I PAL * EAE AGGS L 
QPRS S RPAWATWRNP I F 


13106 


27007 


A 


13246 


204 


375 


ASWPPGLQYELRI I KCLLHCWTRAMIFR 
ERERRERERERERERERERE * VHLKRKG 
A 


13107 


27008 


A 


13247 


158 


2 


I W VF KE T F FFQNF F F F FGKENNFNGF F F 
FFFFFFFFF*DRVSLTLHPSLGYR 


13108 


27009 


A 


13248 


221 


3 


NPSQGFPLLKIFFFFLRWSLALSPRLE* 
NS I * KN* KI * KI S WAWWRTPWPGAWEA 
EPGESLEP GRQRLQNE 


13109 


27010 


A 


13249 


81 


341 


GELNDT IHVKYLG* YI KVL * RNRTNRVC 
VCVCVCVCVCVYACTQKDLF*GTGSCNC 
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K=Lysine, L=Leucine, M=Methionine, 
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GAL VR P KS ARQ AS RL KTQ KTWI RS L KA 
VG 


13110 


27011 


A 


13250 


299 


1 


ATARLRFLHGAWWYAS VVP PTQEAEARG 
FLDPRSSRME*AMILPRCTPASVTDKIP 
SLASQIHRPSRVACVILLSSHAPTEAPS 
RWPPPSPLCPAGSRA 


13111 


27012 


A 


13251 


106 


146 


LCVLKQVKQATWTKYS SSQI FAI LDCMT 
TLS SHPALKYCNYVL ISNQS PNHLRL * C 
SGMISAHCNLCLPG*SNSPASASRVAVC 
FKASQASYLD 


13112 


27013 


A 


13252 


217 


158 


LFEIFCIPFFFFFFNFFFETRSHSVSQA 
GVQWCKQGSLQP*IPINFFWGKEGVIDK 
LILISYGNAQGFKVAVTPVCTTALQPGR 
QSETLSQKRN 


13113 


27014 


A 


13253 


150 


56 


TILILLLLYETESQSVAQDGVQWCDLGS 
IQSPPRSS CHS P A* AG VNRLRQ ENGM S F 
EAEIVLSPDRTTALHPGLQIETLSHII I 
IILILSTISFHQLLH 


13114 


27015 


A 


13254 


174 


145 


PCLKMTTYDFKSALEIAFFCFTAILRYK 
LL I KVTHFKWLRVAAHDCNPNTLGS GGG 
RITWGHEFKTSLANMA* PTTRS YSQRVG 
IMGITIPDEVWSRTQPSHIRSGPSKVYP 


13115 


27016 


A 


13255 


179 


273 


GQWLTPVI SAL*EVE VGGLLETRSLKQA 
WAT 


13116 


27017 


A 


13256 


264 


382 


GLKIGKGCSWWFTP VI PTL* EVKARGLL 
EPRSLKSAWAK 


13117 


27018 


A 


13257 


1199 


1563 


YLWGPAVPPLDLEPGQAGATCGPVSHY 
KIFAKLWAVGSILDLMPGPE*GSFLSVF 
TSLCFIYGVTYPFHSEQPHPASPSASGL 
LLLKGRAAWLVSTLCIGSPAPMLTGIFP 
HKVYNQLFI 


13118 


27019 


A 


13258 


194 


242 


I L YVE TGS HYVAQAGLE LLG S GN P PAS T 
SE I AG I EA* ADE F I YVGVS WQDE CIKSM 
KQVAHAS I PAI SE VEAGGLP E PRS S RPA 


13119 


27020 


A 


13259 


322 


120 


GPTP VLDAFQKLCLS FTSGQQADG 1 1 PA 
FPIRKQGGTQWPKPVIPTLWEAKAGRFL 
* PRSFRPAWAT 


13120 


27021 


A 


13260 


237 


1 


TSKKISLGWWGKQGI PGTWEG*AKRPFE 
PGKPRVQWTQVPALDFSLGGKARLCLKK 
KKKKQKQKTLKLCTHSRITYSRA 


13121 


27022 


A 


13261 


52 


318 


S AVG I HRCDDGSH * P * TPEHKQLS FLSL 
PSSWDYRGITRELFQRFPWIFLQLITAV 
I S SES TVLKNLEIAAVRGSHVRVI MMAV 
PINPF 


13122 


27023 


A 


13262 


274 


1 


KKKKNSPVWWWTPVIPGSPG*AGELLEP 
WRQKVQLAQWPLAKWPQTKWQGCPL 
ALQPGEHQG I FVS QKKKNPKTKRKYWAL 
FCSLPSC 


13123 


27024 


A 


13263 


717 


877 


NSTKEMAHWPGWAHTCNPSTFGG*GGW 
I T * AHE FETS LPNMVKPHL YKKYKK 


13124 


27025 


A 


13264 


288 


131 


SLYIWHSKRLITITNNKISGVWWCLPW 
SSTWEVEAGGSIjEPRR*RPAWATK 


13125 


27026 


A 


13265 


80 


287 


FMNGEAS * KTS ILSQAWWCAPIVPAAQ* 
NEAGELLEPRSEWLVWATRALRI S SRGL 
RFRFRLRRFTSTR 


13126 


27027 


A 


13266 


3 


241 


VGLFLFFFETESCSVTQAGVQWCGLS*L 
* PP P PG I RD S PAS AS Q VAGTTGTHHHT W 
LIFLYF* *SSGFHYVGLGRSSNS 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
\y— Tryptophan, Y=Tyrosine, 
X=Un known, "=Stop codon, /^possible 
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13127 


27028 


A 


13267 


103 


359 


I CVYVY VCI YVYEMRATSKDFLWQVWFS 
FLRTVAVLFEEYLIVL*MWLSFHIHIYI 
HTHIHIFIYVYTHVIilYVYIHTYSYMCI 
HIYI 


13128 


27029 


A 


13268 


172 


12 


SARSSQAIQIQKS VTSGQVKPQW*QS * P 
TPFLPTLIALTTLLLPISPFILIIL 


13129 


27030 


A 


13269 


2 


203 


R I EN I RS YKNLAAIi FMTAKKMGT I QM PH 
NFEWINKMWYIHTLEH*SDKKKKKKKW 
RAEIPKLIIQK 


13130 


27031 


A 


13270 


236 


365 


KIYRQGAVAHTCNPNTLGGQDGWIT*GQ 
KFETRLANMVKPHLY 


13131 


27032 


A 


13271 


371 


15 


QNQINFALIIjIINTLLALLLIIITF*IjP 
QLNG Y IEKSTPYE CGFDP I S PAR VP F S I 
KFFLVAI TFLLFDLE I ALLLPLP * ALQT 
TNDPLIVMSSLLLI I ILALSLAYE*LQK 
GLD*AE 


13132 


27033 


A 


13272 


2 


16 


PRVRTS SRSRAALEL I FFFFLGPPNLP V 
YNGPLGRTKPGTGELDTGGSPILCVGQG 
RHP YWKGGAKPLAP FGKGGGP * GTRPLA 
GPIVPPKAGIiQSEMPGMAPFCGSFG*PT 
RPD 


13133 


27034 


A 


13273 


329 


290 


SSALVPRLECKGIFSAHCNLCLMGSSNS 
P T S AS RVAG I TG VAG QATD KT P QTLS * R 
SQPSHMDPVRWSP 


13134 


27035 


A 


13274 


175 


13 


APCDHRPCPPENNPL*L*FSITFPNPIK 
RPHP YLPLLTLFSDSAHLHPGE I EQR 


13135 


27036 


A 


13275 


355 


861 


PIjTTTPAAPRAPCPPSRLSGQPLTGPTE 
GSRSRLSPNISEQGEPPLALTVGHPLST 
QPGPTVPSELEPIQGPRG*GDCPTPSQS 
A*GGVLSCTPESHTEFKPPPTGGGRRWA 
RLGLNGAT * GREE PLQTRLPAE YPGPGP 
IDPLQPPPISTASMATAFSDFLLLGRDP 
A 


13136 


27037 


A 


13276 


865 


667 


KFLiCILLDFLFFIF*EMRSQHRLECSCA 
I IAHCSLKLIiASTDPPTSAS * IAGFTGM 
CHCAQLFLTF 


13137 


27038 


A 


13277 


103 


375 


WS RRL P WRRGLG Y I E L FQGLE I RHHFL F 
GPHYLRRTQCQGPVI PSELDGQGWDYMS 
PGV*DQPGQHDETPSLQKI*KTSSAWWH 
APWPAT 


13138 


27039 


A 


13278 


177 


3 


QQTEGSRAHSMSHRRPGAWHASNPNTL 
GGRGG*NI * SQKFKTSLVNMVKPCLYGR 
V 


13139 


27040 


A 


13279 


379 


372 


SR * WVCMVAHACD P S I LGGQGGR I T * AQ 
EFETSLRNMVRPCLCLGNTNIYIYIHTH 
TYIFIQN 


13140 


27041 


A 


13280 


83 


1184 


PPAHAARASPPSYTWLCYEVKIKRGRSN 
LLWDTGVFRGPVLPKRQSNHRQEVSSWE 
CRKHISKMSGGGLSTVYFFHRRFQITWF 
VSWNPCLPCWKVTKFIiAEHPNVTLTIS 
AARLYYYRDRDWRWVLLRLHKAGARVKI 
MDYEGERCRGQGSMTGRNSIiRDGWI CNA 
INYASLHRTLKEILR*GSPSGLIVSLLS 
PPAHPPEDSPGNESWLCFTMEVTKHHSA 
VFRKRGVFRNQVAPKS YLHPK* ELS S WE 
RRKHNT * HTNYEVTWYTSWS P CPE CAGE 
VAE FLARHSNVNLT I FTARkC YF WDTD Y 
QEGLCSLSQEGASVKIMGYKDFVSCWKN 
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FVYSDDEPFKPWKGIiQTNFRLLKRRLRE 
ILQ 


13141 


27042 


A 


13282 


1558 


1824 


SFPYLAS FPFCLGLPLETLFHPLGL * KD 
*SDTHTHTHTHTHTHSHTHAHFPSFPDP 
LFQSSPFSSGFIDEYKYPHLWPVMSVTC 
CRFCV 


13142 


27043 


A 


13283 


1 


519 


PHFFLPQGFWGFFSPFPL*KSSSPLKAL 
IFLGGFSPFFPPPKKRFFSKIPRGVFFP 
PPKGKKFFFPPPG* IWPPQGFFLKGPPS 
SSSSSSSSSSSSSSSSSSSSSS 


13143 


27044 


A 


13284 


349 


159 


CPLEPKKNYPFCFFF*ETGSCYVA*AGL 
VLVAS SNPPAS AS Q I AG I QGTSRHAQPC 
RIFLSKT 


13144 


27045 


A 


13285 


748 


869 


WLG WAHACNP S S LGGRGGQ IV* PQEFE 
TSRGNMARPQSL 


13145 


27046 


A 


13286 


315 


410 


VQWFT P VNS AL WE S E AGGLLE I RS * RTT 
WAT 


13146 


27047 


A 


13287 


355 


499 


ARCGGMHL YS Q I FGRLRWEHCLR* KVLS 
QVWWHAPVFAD I WEVEVGALLEARSLRL 
* CAMLLP VNSHGP PTWAMQ 


13147 


27048 


A 


13288 


880 


701 


KSYFSSHLHFEGKKCVSS ILD* S ILVLM 
CSWLMNYTHTHTYI YIYTRTHI CVHVYV 
HNF 


13148 


27049 


A 


13289 


379 


434 


KRGPPPPKKRGCFFSEKFF*GPPKTPFF 
FFFPKKKKKKKKKKKKKKKKKKGGRSRS 
RLADAWADAW 


13149 


27050 


A 


13290 


253 


344 


GLFYNLQKIKIFYVKNLFFFFSNSITEA 
GVQWPNLGSLQPPPSGSNDSPASE*RIV 
V 


13150 


27051 


A 


13291 


561 


745 


AWEPSLVGETNVNS FNQKYINWPGAVAH 
TYNAGTLGGQGGWI T * GQE FE TT LANMV 
KPSPY 


13151 


27052 


A 


13292 


289 


2 


TLPQGEDFNKFVFGS I KQKLQ INL*NTD 
FIGNVFQSWAQQYTPWPPSWVAEVEGS 
LAARS S RPLCT 1 1 T PLNSHCS LAWATQQ 
DPAGRVGRPRV 


13152 


27053 


A 


13293 


281 


229 


LKNKNVNKEKLRQGWAHACNPS ILGGR 
GGWIT*GEM 


13153 


27054 


A 


13294 


276 


441 


GLFPKFLIQKINQNWPDAWHAYNPSTL 
GVQGRW IT* GQ * FKTS LANRGRKS ENK 


13154 


27055 


A 


13295 


258 | 


389 


NM VE KRLGQ AHAYNL S TLRGQGER I T * A 
WE FETSLGNWRPCI 


13155 


27056 


A 


13296 


358 


353 


S FLAFFNGAFLPLRGFFWKTFFFGGVCW 
RRPPL * RKKKKKKKKKKKKKKKKKKKKE 
MFKRNIGGEERGGGG 


13156 


27057 


A 


13297 


196 


1 


LMGKAPFLGGFIPPAFFFFLKKKKGGPG 
AVAYTCNLSTLGGRGRWIT*GQEFETSL 
ANIAKPCSC 


13157 


27058 


A 


13298 


182 


436 


GVTILNVRHRHRKKVTLYRDLKKVRKCP 
KLHGNLRKVFQAEG IASSKTLKGHVWWL 
MP VI PAL * E AKMEGLLE ARS LRSAWATQ 


13158 


27059 


A 


13299 


334 


440 


RHYWLNF * AF YHS S LAPAPQGGGHWPPP 
GIT PLNS LHVPLLNT S A* LAS GV* LT * A 
HHRLVPimRALi IQALRITRILGLYSTL 
L PA* KNFEAP FT I S DGVYGSAFF * ATGC 
HGLNV 1 1 GS T FLT I C F I RQL I FHCTS KQ 
*LWLEPPAWDW 


13159 


27060 


A 


13300 


359 


372 


KKKKICGGKKNPPNKKKVKPRGEKTPLK 
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GV* PFFCLEGFFS PHKFFFFFI FSPNI C 
SPKKSFFLKFPPALFFFNLPFLKNIPSP 
PFLKKASF*EIFFCGPPPFFFFFQNTLF 
FFPPFFFFFFFFFFFFFFLDKFAVY 


13160 


27061 


A 


13301 


233 


406 


VTVEGMEEVLAGSSEGMGSRPTS*V*FL 
KYLLGAVAHAHKPSTLGGRGGRIT*AQE 
LKTSLCNKVRPRLYKINK 


13161 


27062 


A 


13302 


92 


429 


GSHFNSELIEMKFYIIKRNIFIILNCFY 
LIKHQHFHYTKFLLVSVINKF*LATVAH 
AYNLNTLGGQGGRSA*THEFKTSLGNTV 
RSCLYKNLKMSWWWCTAIWEAEVRGLP 


13162 


27063 


A 


13303 


302 


140 


GGAVNSVQSQAWWHMPWPVPWEAETGL 
LKPRSLSL*CPMIVHVNSHCTPAWAT 


13163 


27064 


A 


13304 


115 


312 


LRSPSFVLSPRLKCGGTIMVHCSFDLLG 
* RNP PALP S KWET IGLHHHALLRLL I F 
FF * KWWLFF 


13164 


27065 


A 


13305 


90 


246 


PGWAHACNP S TLGGRGGR I T * GQE F KS 
SLCKIAPPPERKEGRKEGGKKKERG 


13165 


27066 


A 


13306 


19 


310 


DSSFSPIMCFGMSVNKTGLWLGKKKKKK 
KKKKGGPP*KKPLGGPNLPGGGKKKFFP 
* RGAKKN P PGD F WKKTL FLGGE KLG P P P 
PQK*JSfPFFGGGKIGPTPPPKIKPFGEKK 
KF 


13166 


27067 


A 


13307 


179 


56 


WVLVYKKQSILG*AQWVMSVISALWEAE 
AGGSLVSTSLRLA 


13167 


27068 


A 


13308 


425 


290 


KKNIYLAPPGYFWPPQRFFLSPPPPPNV 
VIFFFFLFFFFFLFFFFFFLIQYAEGIG 
VMRGRGEGKMGR * W * KF FGRRVLN I RS I 
LLANFKHWPYW 


13168 


27069 


A 


13309 


214 


371 


IYF* * RW I SQAWWLMPVI PKLWEAEAGG 
*LEPMSLPGQYNETSPL 


13169 


27070 


A 


13310 


975 


572 


AWGMWGVGKHS LVS LGIEE CQASTAL S 
LDKSGWWEEAARREDVLCEDRRCLLCHV 
PAGVRGSLKPELGSRKGKNGQQSGSKPS 
VPSLGPRPQGPGNPALSKGTRPN*AICF 
LCQTPADHS AKKQAPHTL I P IR 


13170 


27071 


A 


13311 


270 


10 


TPNKS LI* RLQFSNI KLANLGLGTMAHS 
CNLNT LGGQGAR ITS GE E F KTGLGN I D P 
IPIESKVI KTLAGMVS PDAWVDAI KRTG 
TSG 


13171 


27072 


A 


13312 


103 


290 


LTEWSGVIL*NCIYLLPQYTSTRVHAY 
I HT Y I HTHTHT S GL S S T S VG S TNHKS K I 
FGGKKG 


13172 


27073 


A 


13313 


136 


1 


SNTSSSLVTQAGGQWCELGSL*PLPPKF 
KRFSCLSLPSSWDYRRL 


13173 


27074 


A 


13314 


88 


183 


RVSTLLKKSCFFFFFFFFFFFFFFFFFF 
FFFFSSQKKKNLKNKKRE*KKSCFFFFF 
FFFFFFFFFFFFFFFFF 


13174 


27075 


A 


13315 


481 


517 


LLS P * R P KGKP KG KKRKPNL KE KKI GE A 
RGGKERQRKTPQRTGKQKKRRKYRRIGR 
RNYR 


13175 


27076 


A 


13316 


307 


196 


GMHLYLPRTW*WVRVIPGSREIEVHAPL 
EPGSQRLLVAEMTPMHSSLDNMNKSPFP 
QPPKTEKQTKNPPKPHIIA 


13176 


27077 


A 


13317 


2 


209 


PARALDLKGSPYME S CSDAQAGVQDS I Y 
GDHL*LRAPAV* *GQTVFVASPSKVGGI 
TGASHHPGLFFLF 


13177 


27078 


A 


13318 


3 


178 


S F C F S VLRE 1 1 GQS L I MKTL P * I THTHT 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K==Lysine, L=Leucine, M=Methion!ne, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T==Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














HTHTHTHTHTQRE I ERERDRERGESQVL 
GC 


13178 


27079 


A 


13319 


40 


382 


DS VP S L.DLKLLME TGPHHWQAS TTT PS 
* FFILEGFHHVAHAGLKLPS S SRPPHLA 
S QS AGI YGCE PLRS ARL I IN YIGSRL WG 
LRNI CLSQDHEECLLCFILQAI SMYRFC 
LG 


13179 


27080 


A 


13320 


107 


1 


P PL I MANS GWAQ WLT PWP TLWE AKAGG 
*SEVRSS 


13180 


27081 


A 


13321 


111 


382 


ASSQLLELAIQVFPGIWIPPTTKQKGVP 
SGKTLVTS KI I AGLKMPKDSRSRPRTVA 
HTYNSSTLRGQDRRIT*AQEFKTSLGKI 
GRPCLF 


13181 


27082 


A 


13322 


49 


108 


CGLY* F F F * AT RS WH VAQ AGL E F LGLS S 
FPASASSVAGTTGMCHHTQLHY* F*V 


13182 


27083 


A 


13323 


1 


257 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERARERE*ECV*EGETVSDRERDFERA 
P PRVLGAVAI Y I EWCLAS L I Q 


13183 


27084 


A 


13324 


1 


248 


EFQYILCSALIFGRSKILK*ATKEVKKS 
KHIPRI SCQDTRKYNWSAKAKRRNTTGT 
GRMRHIjK I VYRRFRHGLHE ENL FL KH 


13184 


27085 


A 


13326 


375 


1 


P PKRRGKTHCS P PKVFPP PQNN * TTP P P 
QLIICYIa*RGGVFFSLPLII*APPAFFF 
FFQFFFFFFFFFFFFFFLQGQYWQFRDT 
DAEGHLASSRPSGAVFPSQGMHTTGSTR 
RWGQCVPFPPTRP 


13185 


27086 


A 


13327 


382 


15 


NWPKNCPLiHFQNWFLRKTPQIFFCLKNF 
FFFFPKKVFYPPKKPFLEKPWPPVLN*K 
KPPPPFFFGWGPGGPQYGPFKKAPPLFI 
RGKERFPLWGKGDFFQIPCPPGPLKKKK 
RAAARDLGTS 


13186 


27087 


A 


13328 


2 


220 


GRVGSSRARAVALFFFFFFFFCFFFFFF 
QNPPLKRGEKKKNTPPLKKKRPLRGGFK 
KQKE C * E KKKKLSGP K 


13187 


27088 


A 


13329 


427 


289 


IQLWSSL* K* LHTHTHTHTHTHTNTCFL 
T VMKE L S THPGNK 


13188 


27089 


A 


13330 


203 


269 


INFGPPREPGGFNQKPQFPSAPGFEPWN 
PPQGPKP*KKKKRKKPFKVWAPGGGSKV 
QNPGLRETGVFG 


13189 


27090 


A 


13331 


216 


418 


D VP VH Y CRL YKP C TLRLHS I Y KKQ I KL W 
PGAVAHACNPTTLGGRGLWI T * GQE FE A 
S LANWKPHL Y 


13190 


27091 


A 


13332 


160 


325 


WERQLFKI AQSGLARWLTPVI PALWE AE 
EGGLFEYTSL»R*LWATQQDPISTKMFK 


13191 


27092 


A 


13333 


339 


62 


GMLPLFVPPQKRGSPPYPCYGVYNSPPL 
KKQRFFSSLGIVLPPIVFITPPPPAFFF 
FFFCFFFFFFFFFFFFFLI *RRLMRRRM 
FSCFLHCP 


13192 


27093 


A 


13334 


45 


384 


DPSVRINTLLALLLIRITF*LPQIiNGYI 
EKSTPYECGFDPI S PARVPFS IKFFLGA 
I TFLLFDLE I ALLLPLP * ALQTTNLPL I 
GMASLLLIIILALSIiAYE*LQKGLH*AE 


13193 


27094 


A 


13335 


270 


660 


AGSRRPLRVPGFSLSMMSPSLCRPVCVA 
QCIRVSFLLTVGAVLRPGFQCLDWPCLV 
YARLSGWTGFPPCRYGKGCCRYDEGCCR 
FGEGCCRCDDRCCRCGEGCCRCDDGCCR 
YDEG* CRCDDGCCHYGE 
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peptide 

sequence 


G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine t L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glu taurine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 


13194 


27095 


A 


13336 


298 


349 


KTTPFYGLRGSKIGTL*CSGVI I AH* SL 
KLLGS SDPLSLVAQAARTTGACHHDQLT 
NAFFFFPLERVPTPLKNNPFLWVKGVKW 
WDLD 


13195 


27096 


A 


13337 


865 


518 


KSATMRMVEDTHKMPAWGGYSGSLQQER 
GELSSLPCP*PESPPPLLIFLKLPSNPL 
PCSPLPCTQGPGQPAPYVSIHTSLKFPN 
AKLSVCLSRAIEFS 


13196 


27097 


A 


13338 


154 


354 


FNLQVKPDGYFVLFICHILPTDSKSGKA 
PGAPCHSRNFHIRKTLGWVQWLTPVILA 
LWET* FSHQENSGLGAVAHTCNPSTLGN 
QGGWIA 


13197 


27098 


A 


13339 


262 


11 


QFHFSRKLTP * KM * LSAHS SNTHEAYVA 
FPKAHKPGLNS KTAF * LGTE AHACNPN I 
IjGGQGGQ ITRGQAFKTSLANMAKPRLY 


13198 


27099 


A 


13340 


358 


164 


TKKGKPRFFLKNSKISRGWWWAPVISGP 
WEGEGGAFV*TGRPKIWLTEVGPLPFNL 
GKKKENPF 


13199 


27100 


A 


13341 


147 


2 


FRP Q KVS VRVGVIiAHACN P T I LGAQ GGH 
1 1 * GQE F * ASMANM VKPSS C 


13200 


27101 


A 


13342 


367 


17 


TSRS WNL FHVL VRF P TAD KD I RE I GS FT 
KKRSLIDLTIPYAWASFTLMMEGKKEQV 
TSS INGSRQKVRSQSGELLSLTPSDFSW 
AHWLMPVFPALW*SETGSLFEVRPSRPA 
WPTW 


13201 


27102 


A 


13343 


166 


492 


EGTQETLCGCI ICLVRGDALNLFHLKCS 
WVGWRGAI CCMELRETAKQKLS VWKLYP 
FEIVFSFSNNNPRPGAVAHICNPMTLGG 
*GGRIARAQEFETNLCNIVRPHLFR 


13202 


27103 


A 


13344 


804 


1124 


TFFFCFFL*DRVWDVAPGWESSDMIMGP 
LQALTSWGSTDPPTLGLQSGLGDLQDTW 
P P YPTS F YYFFLQGRGLTMLPRLVLNS C 
TQAILPPQPGQRSKTPISLKTN 


13203 


27104 


A 


13345 


280 


448 


GGGGKKKKTPKPPLEKNNFSPPP*FFPP 
KKQIKPPPPF*GGGGKKKKTPKPPLEKN 
NFSPPPFKPRKGIFFLFPPLSWVKKKGD 
PPGGTRPLAPL 


13204 


27105 


A 


13346 


232 


400 


AELLRYCSCGHFL*SMARYKTKPGVRWL 
TPVIPVFWEAEAR*LHEPRSSRPASATQ 


13205 


27106 


A 


13347 


138 


332 


RAWKPSFEIFYFFF*RAHTF**SCVCL 
C VC VCVC VC V * THTHTHTHTH KHTHD Y 


13206 


27107 


A 


13348 


314 


395 


WLGIVAHAYNPSTLGT*GGWIT*GPEV 


13207 


27108 


A 


13349 


388 


371 


I MNGWQD KE LVS RV I Q TG I KK * KPRNR P 
E FRAPP *WHERNGRKERERMEGRKERER 
KKGKGKERKRKKE AC I YKVKI E I SNNLA 
NLIINVER*DRRRPDILSWVSLSSRVSS 
VFIALVTCLTSPQFFLSLHLNQILFPLG 
KNL 


13208 


27109 


A 


13350 


412 


40 


LVFRFWMCLFSRELFGCFSHLLTKSMSQ 
M* WAFGD IMYTFDLLF I KKTKNNCKL W 
QGCKEEGSLIHCWRECKLVQPL*RPI*R 
SLKKLQMGLPYDPAISLLETYPKERKSV 
Y*RNICTSGRVG 


13209 


27110 


A 


13351 


372 


148 


FFFFFFFFFFFFFFFFFFATGSCRVA*P 
GVKWL I SGTVPLL I STGVLTCS I SDLGQ 
PVHTSLGNDWPYSQELPY 


13210 


27111 


A 


13352 


209 


206 


KKKKKNFFS FHGKNPGNGGP FGGP P PP P 
F*TFFKKGGGLPKGPPPKGFFWNPPQNG 
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E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I-Isoleucine, 
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nucleotide insertion 














GGGP * P GGEKKT P KP S LGVL F F F FF KF F 
KKRPTAFFFLKKKIFF 


13211 


27112 


A 


13353 


43 


334 


KKKKKKKKKKKKKKKKKRGGGPF IKISG 
GAQFFRGEKKFFFFFLGGGKKTPRGFFE 
KKPFFWGGNFWPPPPQKNFPFGEKKKFL 
GGGGGKNRVFFWGEKIFSLGFFLKKFF* 
KKPEGKNFFPPKKNPVFSP PPP * KFFFF 
PKGE I FLGGGGPKI PPPKKRFFFKKPPG 
GFFSPP*KKKKKFFLPPEKLGPPRNFYK 
RPPPPFFFFFFFFFFFFFFFFF 


13212 


27113 


A 


13354 


264 


496 


KPTILRVTHFLEGCEKYEFLLPVAFSNF 
SEPLKRYYYWLGAEAHVCNPSTLEGRGG 
QIT*GQEFETSLVNMVKTCLY 


13213 


27114 


A 


13355 


183 


9 


IRSPKIPEI RVF F F S RWS YAL VAQ VGVQ 
WGNLS*MHPPRARFQPLSSLSVPST*KY 
R 


13214 


27115 , 


A 


13356 


208 


378 


EICLEFKIIDEMDKCGKLWLGTGAHT*N 
P I TLGGQGGQTT * GRAFETS LD IMEKRY 
L 


13215 


27116 


A 


13357 


256 


341 


KGQTQWLTPVIPTLWEA*AGGLLESRSS 


13216 


27117 


A 


13358 


78 


344 


IVGLFESESKKGQHIVSGWIYLRLLIYR 
FLFPLFSCLCKFFFFFFSPETEFCFVPQ 
AGVQ*HNLG*LKPPPPQLKQFSCLTLPS 
SWNYR 


13217 


27118 


A 


13359 


259 


291 


AQ*LMPWPALYEAEAGGSSLEPRSLLS 
VWAT 


13218 


27119 


A 


13360 


448 


121 


RLFDLGNKKTSSFVQNSENAKYEDSLQF 
YCRIYIYVYIYVYICIHICVYM*TYMCM 
YICI YMYTYTHVYTHVYMCIHTYMCI * A 
YICVYIYIF*KACLPLAISTLSSGH 


13219 


27120 


A 


13361 


125 


262 


FSFQAS VEFTSKTVWS WVQWLTP I ISTT 
* E AQ AGGS LEARNS R P D 


13220 


27121 


A 


13362 


362 


46 


ARAKGPKKIGFSGKMGPP*GAPPPKMGK 
KI * ITPPPKFLFFFLGKTKI KNPPWGFW 
P FGFP KKKRGGGARR * KTP S P FRE KP P P 
QKLKRFKTPFPPLFFFKNPRP 


13221 


27122 


A 


13363 


378 


48 


F KKAARE KVS TRKFRG F VCL F WD F FWRQ 
SHSVTQAGVQ*CDLNLHPPGSSDSPAAA 
S Q VAWTTGTHHHTQL I FI F I FCRNKI S P 
SLLKKYKKNLPGWAGACNPSHLGG 


13222 


27123 


A 


13364 


174 


396 


SLIFILEKRECYYPRSLFTVAWFTVAKR 
* QTLNCPS TDKW INKMCHIHTVE Y YSD I 
KRNE I PMRATCRQTLKA 


13223 


27124 


A 


13365 


76 


280 


PLLPFKAWQWVGCHYW*KLD* IMCYFR 
ALLAFSEYMYIYIYIRTHTHT* IYIYIY 
IYKLTYSHLKVG 


13224 


27125 


A 


13366 


133 


324 


YLL * I LVYCSHL I FS LLCLEGI I LWVF I 
IATLITLNTHSLLINIVPIAILDLAAGQ 
AAVGLALLVSKKKKKGPPLKKPPLGAQI 
SPANQRKKFPPKRKPIKTRRGTF 


13225 


27126 


A 


13367 


488 


190 


FSYAACFLPQCVFVCTFIYLHADGFLLL 
IIFFENSVFILCHSCWVFCFLLQWFLLM 
NLFSLFLLMLMHPVS*MECSIYSFSSLI 
IFHFSLMLLFFCFMI 


13226 


27127 


A 


13368 


229 


372 


YYATKDFSRPRAVAHTCNPSTLGGRGRQ 
IT*GRQFETSLANWKPQLY 


13227 


27128 


A 


13369 


122 


2 


KE ECVGSGTWWVMP VI PAL * EAEGRGS L 
EPRSLRPAWTKK 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F==PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


13228 


27129 


A 


13370 


43 


412 


RPTRPKRQNYGDSENISGCQGLGKKGMN 
GKSTEDFEGSETTLYDAITVGTCHNKFI 
RSHIVYNTKSGP*HKLWTWGGHDVSV*V 
PQLRKT YHPGERCL * *GRLSMYRGRE Y I 
GNLCIFCSLLL 


13229 


27130 


A 


13371 


136 


1 


KSPTWPGAMAHTCNLSTLEGQGGWIT * G 
QGFETSLANMVKPSDAW 


13230 


27131 


A 


13372 


228 


385 


GLLIHKLEFKHFWLGTVAHTCNPSTLGG 
QGRH IT* GQGFETS LANMAE PCLY 


13231 


27132 


A 


13373 


277 


429 


LETTTKAGLPTI I FTSSGQMS I WFFQNK 
P WDS KKLFNLLS ISSP**SS KNY * PWQS 
TVAHT CNP S TLGGQGRWI T * GQE FE I S L 
ANMVKP AR VGRH V I RGLQVS 


13232 


27133 


A 


13374 


194 


3 


NQENY I Y I YTHTHT YTHT Y I YI L * LLL Q 
WVYLAYFLCLSYPSLFAQGFTHTTYTHT 
HMHTLIL 


13233 


27134 


A 


13375 


377 


228 


DRVLLLLPRLECRGI IMAHCRLPRLVSN 
S*APALSFQSAESTGVPNVPS 


13234 


27135 


A 


13376 


255 


465 


NAWKC P F I YK I T F V I FL IHVS C KKFRN Y 
RQRE WKLPTVRPLS PS *AI I FP VTCTYT 
SRWPEATKDPQKK 


13235 


27136 


A 


13377 


336 


38 


VWWCTPWPATQEAEVGGSLESGRLRLQ 
*AVITIiVNEHRESAIiASRRGPEETSSVK 
PPLPTILAHTCFSLPRTGQDITSRFLAQ 
RNTEENLEDQMEARA 


13236 


27137 


A 


13378 


376 


293 


FFFFFFFFYYFFFFK*KFLTKKKILiSSQ 
YI 


13237 


27138 


A 


13379 


215 


16 


HLTWS FTTATEGSKTVTQHS VYRKAKLG 
LGAVTHTCNP S TLGGLGRW IT* GQE YKG 
IPPHGLEDVQ 


13238 


27139 


A 


13380 


184 


64 


VDE S LEGWMMDEWMSGWRGGC INRYMHA 
WMDG*GDGWIGG 


13239 


27140 


A 


13381 


233 


417 


LKP I TKGRKPRGFFLPFKPKQKKYFWG F 
KKKKKKKKKKKKKKKKKKAS RAP F * KKG 
PQKTP 


13240 


27141 


A 


13382 


34 


360 


R WNT TNADHDL KDN I L S P P Q I NL Y I RQN 
S SRLFCRH * QMDS KS P VKI PAGFVL YI * 
I YRYTHTHTHTHTHTD SEL YME THTRMA 
NTIVRKNNSLEHSYYLMLRMTIKLP 


13241 


27142 


A 


13383 


214 


71 


QTLNTDSGPGWIHACNPSNLGGHGGKI 
I * GQGFE TS LANMVKRCLY 


13242 


27143 


A 


13384 


352 


312 


DKQLTLHRTDSLYI F YTY* PPQSCEPI S 
YNNFPCLSQYLYLCPSIjHTHTHTHTHTH 
THTHTHTHTHTHVS VGRRS L F F SGPHTE 
APRSRVSVYT 


13243 


27144 


A 


13385 


393 


271 


IEGQIQYTSTIGNKFKDFYLILCKEGIM 
S RE I S P P S S CHLRQQ * RVRLRERDRERK 
RQRERQRERE * GRS VLHPHVT 


13244 


27145 


A 


13386 


180 


440 


PVEERTLCEDILCFPFSVLLCIQFHLLI 
QHAC^KYPNPNSRFGSWPGAVAHICNPS 
TLGGGGRW I T *NQE FE ARL SNMVKPRL Y 
KNI 


13245 


27146 


A 


13387 


190 


47 


EGEKGVPSTILKMETLLGTVAYPCHPST 
LGGQGGRI AE AQE F * DHLE 


13246 


27147 


A 


13388 


188 


470 


ARPPCKGRDSSAEGPPGPPFPSWSSLGC 
WTRE PPGRGEP I Q VAVRREE S AQDWARP 
E L 1 1 KE WW P GLVAHT CISTP S TLGGRGGW I 
A*A*EFENSQ - 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GJycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProJine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


13247 


27148 


A 


13389 


454 

* 


367 


NKKKPPPRPP*PPPPPLGKTPFFKKKNP 
NFPRGGGNSPGSPSFGGGKGKDPPPGGF 
FPPPPYPRGEGGNPFFKKKKKKKRASLD 
P * WS SQP CPAYEKE LNFAP VLATVTLP P 
GHSS 


13248 


27149 


A 


13390 


34 


354 


HTLLALLLI I ITF* LPQLNGYIEKSTPY 
ECGFDPIS PARVP FS I KFFLVAI TFLLF 
ELEIALLLPLP*ALQTTNLPLIVMSSLL 
Li 1 1 1 LALS LAYE * LQKGLD * TE Y 


13249 


27150 


A 


13391 


1089 


792 


THGFGPQVGSPLDLKQPQKAMPRGPSLG 
NPPPGCQQHSHGAGGP*VPPGPRQPVMA 
SRNLCLPLALPRTHPPSPGQTRDKPRTQ 
DRSCWRDPRNGPGV 


13250 


27151 


A 


13392 


299 


428 


W I RCRGRGRS CLP VPHSAPALL S P L WD 
GTRRRE AGGGT CQ GGS GCMGAH WGLGGG 
EGSGCRSQALPREAAEARRE FKHGGI PR 
H * LTE S LVSWE FRHA 


13251 


27152 


A 


13393 


382 


413 


HGKTHLYKKFKNKKFKATMPA*WLTPVI 
PTLQEAEAEASLKPKSSRPAWVTWQDPS 
LQKI 


13252 


27153 


A 


13394 


64 


401 


GGIPKEDSQAGAFTGIGERQQGIiVIPLL 
KNGQLS TDACVP P WGRVGVERAGP PNQ P 
AGEGGGQGQECSLGLLPEANRCHPCNPS 
TLGGRSGQIA*GQEFEISLGNTVRSCPY 


13253 


27154 


A 


13395 


146 


18 


FS PGWAHACNP S TLGGQGGQ IT* GQAF 
KTSLANTVKPKRPQ 


13254 


27155 


A 


13396 


1056 


679 


DIQVPERPL*NVPEPEAKGEPPDRAVGE 
H I DRD CR S D P AQ Q KRK I FTNKCE RAG CR 
QREMMKLTCERCSRNFCIKHRHPLDHDC 
S GEGHPTS RAGLAAI SRAQAVAS TS TVP 
SPSQTMPSCTSPSR 


13255 


27156 


A 


13397 


440 


423 


IHSPPTQRRV*QRERERERERERERERH 
ALAERNRTREG I S GTTGERGNLMLVGHH 
LPGPLSSRRLGFCPDGFGGQHCLAQGDQ 
VRL 


13256 


27157 


A 


13398 


2 


256 


IETLGSAVEFIPYENTYQTYLKKKNRNS 
HMQSQPNRKGHIRLKYIAAWA*WLTPVI 
PALWEAEAGRSLESRSRNPVSTKKKIQK 


13257 


27158 


A 


13399 


102 


922 


LSFFLFSETGSCSVAQPEVQWCNHSSLQ 
p * TPGGPSMQFQLPQEQGQRTAVWHGGR 
A 


13258 


27159 


A 


13400 


198 


55 


KDYS AWHTCNP S TLGGGGG WI T * GQEL 
KTSQANMVKPHLYKNPKIEF 


13259 


27160 


A 


13401 


19 


447 


ESALNLPSAGI IGVSHRTQP I FCI LNAL 
ALGVL I LERS PLQ S PQVP PSHSHTKPGS 
LPSVTPGRGPGRPRAAEPTAQGRRYNSN 
PVLISPGSVHPASFALPAEPP*TGAPST 
PRPCVLRPLLLGNAMDLLCTKLSSRLPP 
YRT 


13260 


27161 


A 


13402 


93 


410 


DSNLNYSLFFHGEADLGTNQVLTHPSTT 
AMYFEHYCQP P * I VHGT INT * P PWHKN 
PIHIITPSPCLRASTAINLLLSHINCYS 
*ATPHPLGYQQTYLPLTVHST 


13261 


27162 


A 


13403 


55 


387 


SNSHT YS LKKS AG I TKFQNWH XVNC I CI 
CQVWPLEMVNRHTVI LCQ YPVKPRI L Y 
QHHTAILVTILTFTLRPGWSHACNPST 
LGGQGGRI I*AQEFKISLGNIVRPCLY 


13262 


27163 


A 


13404 


39 


365 


SGDRRVRLLLKI I T F * LS QLNG Y I E KS T 



1350 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KKLysine, L=Leucine, M=IVIethionine, 
N=Asparagine, P-Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














RYE CGFDP I S PARVPFS I KFFLAAI TFL 
LFDLEIALLLPLP*ALQTTNLPLIGMSS 
LLLI I ILALSLAYE*LQKGLD*AE 


13263 


27164 


A 


13405 


32 


350 


HEFATSTSEMTKGTS* IVKRRIKTHTLC 
RRCGS KAYHLQKS TCGKCGYPAKRKRKY 
NWSATAKRRNTTGTGRMRHLKI VYRRFR 
HGFREGTTPKPKRAAVAASSSS 


13264 


27165 


A 


13406 


326 


668 


L.QFKI FCMPGI FT I SE VS SMSN*RTAYG 
SQSFHKIVLI *LATFGQALSC* IL* IHL 
G YL KNKT KHRAL VAHACN PNTLGS PRGR 
IA*GQEFETSLGSIVRLLLSTKNLKKKK 
NM 


13265 


27166 


A 


13407 


170 


472 


KL I S VWS EES S YC VEHGE PWPQEKEYS 
WliEIGKAEMERSTLESLLGATLEALSSN 
PTRKMQGLKLKLQRASAFKEE PWLGWA 
HACMPS TLGG* GGRI A 


13266 


27167 


A 


13408 


178 


422 


FYNRYF*YTKTYQTENIYCKIMSFRLLV 
*KI * SLPGAMAHACNPSTLGGRGGRI I * 
GREFETRNS 


13267 


27168 


A 


13409 


274 


444 


LiFSQYIVFVHYSSFVLILFV* *LFSQYI 
VFVHYSSFVLILFVILHNYTYIYIHTHT 
HTYTL I FLL IFSQI SLGWMKL I L 


13268 


27169 


A 


13410 


205 


23 


KEFLKFHRKCICQQVKKIWPRAVAHACN 
PSTLGGQGKRIT*GQELETSLANVEKPH 
FSNS 


13269 


27170 


A 


13411 


25 


426 


S VWWWSLETRSSRIiRLLKKI *NLQNKKR 
KFEHRHAQRKYDLETQGEDAIYKLRRER 
RKE E E * E KE E KKRRRRKE KE KKKKKPT L 
LCLKKQRNQHLDLRLPASRTVKKLISW 
*ATHSWPCYGSPSKLIQQSIH 


13270 


27171 


A 


13412 


227 


52 


E IGSHS GHPGWSTVAQS * LTLTSNS * TQ 
LILPPQPPRELHYNCVPPCPAFCRQALA 
MF 


13271 


27172 


A 


13413 


345 


67 


SKCC YWAHDC I LLDL I CL YFLELVGC V 
DCFSSSLGHY*PLFV*IFFLSPSLLFWY 
SHYEHICVPHFSKAPFIFLHFFPSPEFP 
FLCIEMVYV 


13272 


27173 


A 


13414 


345 


33 


KVF F F L 1 1 C KMRMQNKQ KNKHG I CLLNS 
AMNTC I S FPLRQQ I MTTWWLE VTQI WGV 
LF YL FHF I EMGS C P GWS AVA * S Q P SAT P 
VSWSQVILLPQPPEDLGNS 


13273 


27174 


A 


13415 


187 


415 


FNQGWAFLFPFCVI FSYP WKDRNRT I FS 
IKITSFQESVLKLFCIFEVS*II)KIGLG 
QVRWLTP VI PALRRPRQADH 


13274 


27175 


A 


13416 


143 


50 


CMKLQS S I VMLGWAHACNP S TLiGGQGG 
WIT*GQGGWIT 


13275 


27176 


A 


13417 


170 


17 


KQMREKEGNGLLTISKVWKQPKSPSQS * 
* IKKM* YICTMECYSALRPEFPE 


13276 


27177 


A 


13418 


7 


163 


IK* IWYI * TMEYYSAMKRKKIMSSAAI W 
MKLEAI ILGEVMQEWKTKNLMFSFISGS 


13277 


27178 


A 


13419 


901 


709 


SGRKTSLGNIARSCLYTKSFKIHCLPVV 
PATQEAEAGGSLKPRSLRLQ*ARSHHCS 
PVWVTTRP 


13278 


27179 


A 


13420 


236 


4 


VIRAKTGSHNSLFPPHTFFFQGKNLCFF 
QKKKKKKIYIQSKPGAMAHACNPSTLGG 
QIT*DQEFETSIiATMVKPNL.Y 


13279 


27180 


A 


13421 


46 


141 


NSVISAHCNLCLPGSSDSPSSASRVAGI 
TGWC*HCNLCLPGSSDSPSSASRVAGIT 
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SEQ ID 
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Amino acid sequence (A=AIanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=PhenyIalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GWC 


13280 


27181 


A 


13422 


604 


1121 


LGSGDLP*EINPLSSCSLLREKDPPTTS 
GPQTDQPKKHLTNFKSAPYKTITDAELR 
VTLTVEAHLHPGE INSHVAHTKP VWWS L 
HTDAHE I WCRDSDLKLVPTLPL I PLEAA 
LRNITHSLSIPPPKNFRRPNTSTLFCVI 
FLINIRRQEYQASEPKPSHRIPCDLHVY 
AQMA 


13281 


27182 


A 


13423 


214 


22 


IRESSSTPSSPTLPESCNSIPFPF*KFI 
WSGWAHAYNLNTLGG*GRKIARAQEFE " 
NSTRAGLW 


13282 


27183 


A 


13424 


397 


330 


SEYNSECS * VQWHTLWSATQEAEAEGS 
LEAKS S RLLCAL 1 1 PVNCHCTPAWAFPM 


13283 


27184 


A 


13425 


729 


885 


AKCYPVRNISLWPGAVAHACNPSTLGGQ 
GGR1T*GQEFRTSLANMVKPHLY 


13284 


27185 


A 


13426 


231 


2 


ASVFFKATLVDLHW*IYIYTYIYTYIHI 
YTHTYTHIYIYIHTYIYLNVNQYICLYV 
HINANQQGYIYTGRTPICFF 


13285 


27186 


A 


13427 


184 


171 


liS * Y I Y I YTHTHTHTHTHT Y I L Y I YT YE 
QPSSQSS 


13286 


27187 


A 


13428 


372 


166 


CKFRPRKINTTFSHI CGS * KVDI IEAA* 
HWLGTVAHACNPSTLGGQGRWTI *VWEF 
KASLANLLTPLLQ 


13287 


27188 


A 


13429 


391 


437 


MYSLGVGFFHSTI * LLKD I FWSGWAHT 
CNPSTL * GQDGR IT * GRKLETRPGNRAK 
LYLYKK3STSN 


13288 


27189 


A 


13430 


39 


254 


EFIPRAQDLiETSQGNRVRPRLYRKFKNlSr 
*VWWCASWPAMWEAEAGGLSEPGRLRI J 
HSNMDNGERSCLKKK 


13289 


27190 


A 


13431 


336 


488 


DIWPGMVAHACNPSTLGGQGGRIA*A*E 
FE T S L S NMARP WL KKKLARH S GA 


13290 


27191 


A 


13432 


154 


481 


FFFFFFFQRQKNGFF*KGIFFFFKLEGR 
GAIFV*PNLTLPG*GKSPASPIi*KGGKK 
GGAPPP*LIFFFLKKKKLPLLGQGGLKL 
RALGKPPPFPSQRGGINGVKTPFGL 


13291 


27192 


A 


13433 


388 


280 


KS KL KNGC C VAL CLLGL I * S VL LA* F KT 
*LFCVSLDMYIDTTCSLSISIYLSIYLS 
IYLSIYLSIV*SVS 


13292 


27193 


A 


13434 


187 


48 


S I SS TKGPGAVAHAYNPNTWGGGGGR I T 
*GQEFETSLANMVKPRNS 


13293 


27194 


A 


13435 


71 


309 


DSVSEEEEEEEVELAHPLAAERYHCERA 
ET*VKAIiLWLELCDERVSSRSHTEEDCT 
EELFDFIiHARDHCVDHKLFSNLK 


13294 


27195 


A 


13436 


18 


375 


RPAVPGRPTRP INTLLGLLL I I I TF*LP 
QLSGYIEKSTPYECGFDPISPARVPFSI 
KFCLLAMTFLLFDIjE ialllplp*alqt 
TNLPLIVMASLLLI I ILALSLAYE*LQK 
GLiD*AE 


13295 


27196 


A 


13437 


373 


27 


THPYYSHQEYQSP* PLTGALSALLMTSG 
LAM*FHFHSITLLILGLLTNTLTIYQ*W 
RDVTRESTYQGHHTPPVQKGLRYGI ILF 
ITSEVFFFAGFF*AFDHSSLAPTPQLGG 
H WP P TG I T P LNP LE VP LLNTS VLLAS GV 
SIT*AHDP 


13296 


27197 


A 


13438 


575 


699 


LGT VAHT CNS S S LGGRGGW I T * GQE FE T 
SLANMVKTCPQKK 


13297 


27198 


A 


13439 


247 


168 


LENLIYTRVLERHREAKVHFPFSMISYS 
DKRTDTFVLTKTH* HTHTHTHTHTHTHT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HTRGSHFFY* HTHTHTHTHTHTHTHTRG 
SHFFY 


13298 


27199 


A 


13440 


391 


168 


T KKKKHP S PRGARE VFF * Q P P P P P FC VL 
* FFIVFFLSQGGGGGGGEFPPFFFPPPT 
SPPSPP PRGVWKKT P P FS 


13299 


27200 


A 


13441 


403 


246 


LFPTAI IGGPPGFPPPPF*NPAPEFFFW 
GPKKKKKLSPPPAIKMVFFKGPPP 


13300 


27201 


A 


13442 


305 


414 


KNNFHPSILGG*GGWIT*GQALETTSLG 
NMVKPHIY 


13301 


27202 


A 


13443 


269 


409 


PGTVAHAYNLSTLEG*GTMITQGQEF*I 
S LVNT VI PHL> YQKKKKKGR 


13302 


27203 


A 


13444 


295 


37 


DPSYLAWHMVTPHQVSTQHTLPRPRPCL 
PEMNFLVSLKIWQCWPGTVAHACNPSTL 
RGRGGW I T * GQE FE T S LiANMT KTRTRGS 
AR 


13303 


27204 


A 


13445 


256 


388 


FCGFWPGAEAHACCPCTLGGRGGWIT*G 
QKFRTS LAHMVKPRVC 


13304 


27205 


A 


13446 


277 


374 


SRLTATS SRGGW I T * GQEFVTTLANMVK 
PLLY 


13305 


27206 


A 


13447 


14 


387 


PQVIHSPWPPKVLGLQA*TTSQAWWLTP 
VI PALWEAKANGSLEVRSS 


13306 


27207 


A 


13448 


185 


379 


HFGR P RP ADHLR P GATNHPG P Q GE T P S L 
L * NTKKKKRGRP FKKKKLQS LVWQNKI F 
FIiAELKLLW 


13307 


27208 


A 


13449 


168 


346 


ISILRTNNMISIKINLRLFIDELKKRDP 
FIFPYFFFFFFFFFFGGNHSL* SLIVIF 
LIC 


13308 


27209 


A 


13450 


57 


176 


VAESRPGPGGTATELVPPSTRLLTRAPR 
DLLTGKRKP P PLGRMS RARVQ WHNHF * T 
ESCSVAQARVQQRNLGSL*TLPPKIKQF 
SCLSLLSSWEDCLSQRVRDCSEK 


13309 


27210 


A 


13451 


411 


241 


PPLFFFFFFKQKFPSVP*GGGQGWDFGS 
LQPPPPRVKQIFCPKIPPPWPPKKGGVP 
G 


13310 


27211 


A 


13452 


307 


298 


PTHS I VRNNKSL * IN* TSSMCSGLKVMR 
R* SQKQEDP* SGWAHACNLSTLGGRGG 
WIT*GQEFSTFLCPSA 


13311 


27212 


A 


13454 


191 


108 


GWSQTPGLGRSSLLSLPQWWDYRMQCRG 
V I LAPHCGLD LLGS GD P PAS ASQ VAGTA 
AFENCATSFWLP* FLNTQCFKI YKTKN 


13312 


27213 


A 


13455 


242 


400 


IATLLDYLQIPNTGPGAVAHACNPSTLG 
GQGRWI I *GQEFENSLANTVKSCL 


13313 


27214 


A 


13456 


1 


346 


YVTTARCSCWAGTESHMGAKDFYRQEKH 
PFHADSGIGWA*DRLRQRYGRFGRRM* F 
SWARGAGEKHSLA*AQWLMPWTTFWEV 
KEGRS LE PRS LRPAWAT WRNVS TKI KKK 
KKG 


13314 


27215 


A 


13457 


166 


289 


VRSSKLNTWVRWLVPI ILTLWEA* AGGF 
LE PR I S RLVWATQ 


13315 


27216 


A 


13458 


220 


76 


TAS L KL S KS WLD WAHACNP S TLGGPGR 
WIT*GQEIKTSLANMVKPSP 


13316 


27217 


A 


13459 


219 


46 


SPPLKKKNNFFPPGVMGAPPRFFLKGPP 
QNFFFFFFFFFFFFFFFFFLKQ*EQTFL 
LIY 


13317 


27218 


A 


13460 


126 


1 


WSLGAVARS CS P S TLGVQGG * I T * S QE 
FETSLANMVRTLTS 


13318 


27219 


A 


13461 


330 


420 


NICSWAQWLTPVTPAL*EAEAGGSPEVT 
SS 
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nucleotide 
sequence 
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NO: of 
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sequence 


M 
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NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyla1anine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13319 


27220 


A 


13462 


577 


766 


AQWHTS I VIiAA*EAAAAGLLEPRS * SRL 
KCTMI I P INSHCTPAWQHMRPCLLKKNF 
SNSFISK 


13320 


27221 


A 


13463 


206 


387 


VTVGETEAEAGRGHGLLTCCVLKTQKWL 
GTVAHAYNPSTLGG*GGRIT*DQEFETS 
FLGS 


13321 


27222 


A 


13464 


167 


41 


DFWLGAVTHACNPNSVGG*GGQIT*GQE 
FETSLANMVKPWNS 


13322 


27223 


A 


13465 


480 


79 


SRVISPIiLILNSVISAKSLDPVGSQGKS 
KGQVWSTQEERLEPGVMGEFTPGPCPCL 
VSQGPSLCPSPPPYSFRWSRLSQAPGSI 
RVLI *RPAAATSAPPPAQLGLPDTAFPP 
PLIiSRDGQWAAGRQGDRDKGLP 


13323 


27224 


A 


13466 


3 


373 


D AWVAHAS AHAS VCL 1 1 DTLLALLL III 
TF*LPQLNGYIEESTPYECGLEPISPAR 
VPLYIKFFLVAITFLLLDLERALLLPLP 
*ALLTTNLPLIVMSSLLLIIILALSLGY 
E*LRKGLD*AE 


13324 


27225 


A 


13467 


214 


55 


TRQKIFNLFSS*KKFLIPPARAKDFLFF 
IFFFFFFFFFFFFFFFSCSRILLHR 


13325 


27226 


A 


13468 


111 


358 


VMKVFYI * IGELVTQAYI FVKSHQ I AH I 
LFHSMYVLP* IKMFWPGTVAYTYNPSTL 
GGHSRR I S * AQEFETSLGMI VS PHLY 


13326 


27227 


A 


13469 


199 


607 


R I RGT S Q KLL LLKKL I S L I P KS Q P TRGT 
P*TTFPPPNTTNFPPPPVAATAPDPSPA 
HFVSSPYNPDLGSP*PECPSPGRLQREI 
EQCKKDIQNFPFPTTSRICSNDLSLKGS 
AS RRRGHL F C E RP VNQ FRS PKP KK 


13327 


27228 


A 


13470 


128 


256 


EDWMWWLTPVI PTPQEVEVRGSLE PRSS 
ELQ*AVIVPCTPSE 


13328 


27229 


A 


13471 


94 


388 


KGEGKTEELWSWRSHRGTHLCLLiADISA 
LPLHYYYYYYYFWQRWSLPDQPGQHGET 
PCLPKIQI INRAYWHVPWPTDQEADAR 
* LPEPNRQRSQRPE 


13329 


27230 


A 


13472 


359 


73 


ATEPGQLFYF*ESESHNVTQAGVQWLDH 
GSLQPPTP GL KPS F P L I HTG I TAAS H YT 
WLNQLLPMNVSLLTFYESPQLKIQRSHI 
LCNSIHIKFLE 


13330 


27231 


A 


13473 


332 


10 


I LS KRGFPKPGRYCE VNPFLCVQGAK* V 
RP * EDTS LHCE I YHTHTHTHTHTHTHTN 
LKP*TSKELIPQKTESLFFSLCIKKSLL 
RFQTFPDAGSTGNSVGCACFISK 


13331 


27232 


A 


13474 


355 


385 


IMVCIKQSCVH*KNN*FWPGTVARVCNP 
RTLGGQDRQTA*AQELKTSLDNMAKPCL 
YKKFF 


13332 


27233 


A 


13475 


297 


425 


RKSAWWLMPWSPL*EAEAGGFLEPGSL 
RSAWATWGDPHLYKK 


13333 


27234 


A 


13476 


160 


1 


KS TGRLGM VAH I C I P S TLGS QGGWI A* A 
QEFETSLGKILSLPEIQKSASLITW 


13334 


27235 


A 


13477 


334 


79 


GMKEQ INFL YQKRG I KPNKLQAENWL I 
HYN* KLLSNS FK*LQDTAS I * YFFGNYL 
NRPGMVAHACMPS TLGGRGGWI T * GQEF 
Ii 


13335 


27236 


A 


13478 


234 


385 


YAKFHVS I NE KKP D VAHACN PGTLGGQG 
KWIT*GQEFKTSLANMTKPRIY 


13336 


27237 


A 


13479 


408 


129 


KRS INRKKGGELS S SHFLLLLLP PLDEE 
PSPPPFCPPLPPFPPPCPRLPQSG*GAC 
GRLHPC * MDLGPCKCPARKVFSHLSCSL 
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M 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~ Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=0Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, Y=possibIe 
nucleotide insertion 














VLQVCRSP 


13337 


27238 


A 


13480 


251 


384 


TITPWRGAVAHVCNPSTLGGQSRQMT*G 
QEFKTSLSNMVKPRLY 


13338 


27239 


A 


13481 


164 


300 


AGDD S DNS MGL LGR I HE I KF F KCLG P CM 
TTRFIVPALWEAEVGGWLEPTSLRLR*A 
LIALLQCTPA*GTVYDYKVYRPSTLGSR 
GRRMA 


13339 


27240 


A 


13482 


2 


30 


TGCHQFRFLNNPFS PHPVLQLVSPGPDC 
LI FR YFKF * LHQLM * INPNANF CGARS F 
LCKFRWIS*LRYPSLVALSVLFVFK*S 
VQI * VKKKKKKKKKKKKKKKKKKK* K 


13340 


27241 


A 


13483 


39 


199 


LDVGT CS P I VPAS QEAE AGRLHEPTS LR 
LW*AMIVQLYYSLGDRARPCLYLKK 


13341 


27242 


A 


13484 


18 


72 


TRTRSYPGSTGCQLHTFGLVSLYNCVSQ 
FLITNLYISIYRDIY*YIYIYLYREIYL 
YIYIEIYTYIERYVYIHI*ILGLQVANC 
IPLD 


13342 


27243 


A 


13485 


266 


496 


LDSASDI TQAGVQWCNHS FL * PQTPGLK 
*ASTSAS * VAEITGLSQHVWVKI IMSVS 
LiSSYTFIQLPSILLYGNTAP 


13343 


27244 


A 


13486 


286 


72 


HILSHCTVLWFLTPHHGTTHYKNTTGQP 
SIVAHTHNLSTLGG*GRWTT*GQEFKTS 
LANT VKCS DAWVDRD 


13344 


27245 


A 


13487 


309 


6 


RFFFFFPPGKKGVFFQRFFFGFPRGFPP 
PRFFKTGPEIFFFGPLKKKKNFPPPGGK 
IVFF*RGAPPFFFFFFFFLFFFFFFFLK 
KNYLIiLEKVQS PEKEG 


13345 


27246 


A 


13488 


183 


459 


RRVKKI CKCII WQNNYL * KGINT CSMPQ 
KKKKKKKKKKKKKGAPFKKKKKKKIS *G 
GGGPFF*KTKIKPPGGRGFFFFFGEKIM 
LFPAVFI 


13346 


27247 


A 


13489 


348 


1 


CEGKDGL P I Q FAS WFKYAGFHS LES I FQ 
SFCQTKKLEAPKELSPCSQLYRYNWQLT 
CRRMKLDPHLLPYTKINSRPGAAAHA* N 
PSTLGG*GGWII*GQEFETSLTNMGETP 
PTRP 


13347 


27248 


A 


13490 


313 


320 


KQGGPPLAQARVQWYNLSSGETSLPLPG 
SSNPPPSATGVAGTTGSPPLA*LIFFFL 
* DRENGDQRKL 


13348 


27249 


A 


13491 


187 


350 


GNPVKK* IKNTTSSSGWVWWLMPIIPAL 
WDAEVGGLLEPRSLRPAWATERDSVS 


13349 


27250 


A 


13492 


211 


83 


E YAKKNGKLLS WRKYL* S VC VC VCVTTC 
VCVCVCVTTCIKNI 


13350 


27251 


A 


13493 


286 


407 


LGVEAYACNLNTLGG * GGR IT* GQE FET 
GLGNTARPCLYF 


13351 


27252 


A 


13494 


897 


1033 


KFIFKQMQWDDHSSLQP*TPELKQSSCL 
RLP S T WDYRRE PLHLAM 


13352 


27253 


A 


13495 


265 


3 


HLILLEEDLTWQRGKWLKGRVSLCSLGS 
* TQ S LT VL KKGRGLG WAVAP S TLGGRG 
RW VALAQEIiETS LGNMVKLS LS KI QKLA 
GMV 


13353 


27254 


A 


13496 


319 


1 


C FKLWDT CAQ CAGLLHR YT C VM WC YTH 
HPEFCYNIMGLPKYMWSYLWPKCCHAVY 
HFRYI KNLLPGCVAHACNPSTLGGRGGW 
I T * GKE FKRRMKTKV* IG I YF I 


13354 


27255 


A 


13497 


265 


11 


VLGE VSNQHLLGAP SNRTP SEVPLAQAA 
QGRPSLVPPSAIVFPSYFTICYAFLSIS 
MP 1 1 P * VWE AE VGELLEPRSLRQA* ATW 
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nucleotide 
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Predict- 
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nucle- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H=Histidine, I=Isoleucine, 
K— Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, \= possible 
nucleotide insertion 


13355 


27256 


A 


13498 


164 


22 


GIKRKLIKSGWAHARNSSSIiAGRGGWIi 
T*GQEFETSLAKMME I PSV 


13356 


27257 


A 


13499 


277 


363 


TYLCYI FNI CLFVGE I EKQR * LI * FHRS 
TLNGHILYVRPVSPNPSQAVASPVTCTY 
MPTWPEWEESQKK 


13357 


27258 


A 


13500 


1 


270 


YTCSVLLLVTVNLiLiLCLIYKLNFIIGML 
RKKT VYI Y I ERI Q YYL KFQAS TGCLGVY 
PT V* P YMYT YT YM YM YMY IYIYIYIYID 
CLCLDG 


13358 


27259 


A 


13501 


170 


383 


I LYS S KLTYPS PQ I VI FFCGKST * GWVW 
WLMPVI PAI * E AS VGGS L EST S LGQAWA 
TGRKAFFVFYKEYKN 


13359 


27260 


A 


13502 


31 


402 


GD YLYLREGWQE PARREQHAQNLEPNRT 
FTPHL I CLTNYRLFSIiAWSECEERRNET 
TDKNQTVKDLALDMKRVLPFS *GHQKVI 
KVG WAYACNP S TLGGQGQW I TRGE E FK 
TSLINMAKPYLY 


13360 


27261 


A 


13503 


232 


446 


RNRPGAVAHACNPSTFRGRGGRIMRLGD 
*RS 


13361 


27262 


A 


13504 


196 


390 


VNKAE KKHL YF VARKE I LRPGVWHTCN 
PSTLGGRGGWI I * GQEFETSLPLPKTQK 
LAECGRGCL 


13362 


27263 


A 


13505 


146 


7 


LKYVLYWLGAMAYTCNTSTLGGQGGWIS 
* GQE FE TSBANMVKP CLF 


13363 


27264 


A 


13506 


177 


18 


SQNFGRPRQQDHFRLDGRHSETPSLQEK 
VKKIiTRHGGACVQ S QLLERLRREDHLGP 
GGGGCSVP*SVNII 


13364 


27265 


A 


13507 


117 


338 


NKI LKKKKGGGR F KE S KF T S P GLQGNS F 
FMGPPKLNSRAGV* QRREGKNLGVPQLK 
PFEANPLFARGPNTKNP 


13365 


27266 


A 


13508 


306 


443 


LARYEPAVRTRAC * AQWLTAVI PVL * EA 
EAGGPLEARNLRPAWAT 


13366 


27267 


A 


13509 


146 


390 


KMFKGHEQAAHRKKKKRGGRFKGSKFTS 
ACLQRNIFFLGPPKLNSRAGV*QRGDWK 
NPG VTQ FNR F E ENP L FARGPNTQKP 


13367 


27268 


A 


13510 


119 


356 


NEDRNLRGGCPGR*IjLRTEGVCSNPAGW 
SGIRWCEESGGLFWRWRGTESVLFSGL 
FVCC VFAQE KA.KATGRAE VS L CP 


13368 


27269 


A 


13511 


221 


21 


EDLQRDKPLGS C YS TCGWAEQWYLQHPG 
GAGSKCRIWLGAVAHAYKPSTLGGQGRW 
IT*GQEFLSL 


13369 


27270 


A 


13512 


163 


418 


TFPDDQCLMLQDHACVKRS IQSA* YLTP 
AVLAHWEAEAGGLLEPTSLRPAWAT 


13370 


27271 


A 


13513 


239 


21 


FGNLGGPGGRTACIQFSLGNIVRPYFYK 
TKP KP KPNQNKTKI S *AWWCMP I VPATR 
KAEVGESLEPRRSRLQ 


13371 


27272 


A 


13514 


135 


11 


KF F F W PG VXAHAYNPS S LGAQGGR I T * G 
QEYETSLANMVKS 


13372 


27273 


A 


13515 


162 


57 


EGTLRSRRPPLGGWVT*GQEFETSPANM 
VKRCLY 


13373 


27274 


A 


13516 


424 


54 


PKRGFFPTPFIWVPPVFPLPPFFKPPPR 
IFFLGPPKKKFFSPPPGLKIFFF*KGPP 
PFFFFFFFFLFFFFKKKKKRLGGFFFFF 
FWARPFSFLSLFFFKETIPLYYNIWMHK 
DSCKAVIHHYH 


13374 


27275 


A 


13517 


310 


349 


GGGPLKKKLFFSRGGERFFFF*GAPPFF 
FFFFL^SSKIFIFFIiKSFFFFFFFFFFC 
FFFFFSQILFFISSPCFVFFFLNYTSRT 
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M 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine> 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














CSYVCALILFKGNTYSRKARHILW 


13375 


27276 


A 


13518 


273 


393 


IKCLSYRKKATTLGGQGRWIT*SQEFET 
S IjSNM * KPQL Y 


13376 


27277 


A 


13519 


450 


139 


I PP IALVS P FAKHTWQQQMGLVRRSS KG 
NNMSKGWEI IHPSRI FCSNHTLLRIALW 
LGAVVHTCNPS TLVGRGGW IT* GWE FKP 
SKTSMERFCILSRHKRTRGS 


13377 


27278 


A 


13520 


47 


427 


KMKQLFEMLVFQNWTI FLSHS I FRTIR* 
LTLE PLMFRYYFKQ * RTLL P PKF * DRVW 
LCHAGGCL*CSGTVSAHCNLRLLGSSDP 
STSAS * VAGT VGAGHH I WL I LNNREL P K 
QKFFINSSRNLFFT 


13378 


27279 


A 


13521 


199 


98 


SLPSVKLSICCEFPEDIVSKDPIPWLP 
WKRE S C S LCAL PNLLPK * E KTN I QVP * 
VFQSTI KAKAQLGMVAHACNPSTLGGRG 
GWIT*GLR 


13379 


27280 


A 


13522 


314 


18 


QTLNFYILKIYNIISLVHC* IFEARFQI 
LV*V* *NYPF*SLICFTFSWHETQNT* * 
KNLI IWAGAVAHTCNCNTLGDQGRRIAL 
AQEIEPSLGNIAIQ 


13380 


27281 


A 


13523 


18 


231 


CPPAVFGTS IEQ IQLKYD I KD * KIGQAW 
WCAPVIPGTQAEVGGSLEIRSSRL*CAM 
S TSMNRHC VPAWAT 


13381 


27282 


A 


13524 


239 


465 


RVQVSECLLHREKCLYFQLGLS *LIPQT 
GWLKQQKLPGTVAHTCNPGTLRGRGKRI 
T*GQEFETSLANIVKPRLY 


13382 


27283 


A 


13525 


388 


468 


SLRLGT VAHTCNPS TLGG * GGW I TRSG 


13383 


27284 


A 


13526 


342 


369 


CTLNTLLL YS I SDFGVS AFLATGGD ITR 
NKVRKT * LRLGT VAHT YNPNTLGGQGRR 
LT*AQEFKNSLSNIGRPHLYKTKKTKKT 


13384 


27285 


A 


13527 


241 


231 


EYVCIWSHSSLYFSSSLYIHM*VCVCVY 
I YVHTNSH I YTHM Y I HTHTHTH I C I YKL 
EEKYKEL 


13385 


27286 


A 


13528 


241 


231 


EYVCIWSHSSLYFSSSLYIHM*VCVCVY 
I YVHTNSH I YTHMY I HTHTHTH I C I YKL 
EEKYKEL 


13386 


27287 


A 


13529 


34 


299 


SAPASASPWGIGVLYRPLACPEDRFSPG 
PEANCGEIETTELRVTLSPRLECSGTIT 
APCSLDLQGSRDPPPSAS*VSETTGASH 
HAQL 


13387 


27288 


A 


13530 


158 


45 


SGHRSRARWL * P VI S ALWEAS AGGS PE I 
KSLRNRRPG 


13388 


27289 


A 


13531 


342 


1 


C F FL KGAP P FF F F Y F FNFF F F F FLVGTD 
KLILKVTWKWKGPKIDKAVPKKNNQARG 
LP * PD I KI F YKVWLGAVANAYNP SNLGG 
*GGWIA*E*EFETSLANMTKPQYKKYKK 
LA 


13389 


27290 


A 


13532 


589 


742 


RIMKMLRIKICGTGPGMVAHAVNPSSLG 
DGWIT * DQEFKAS LANMIKPHLY 


13390 


27291 


A 


13533 


74 


446 


HVGIPSRLTSVSHPRHCKHPRTESEHVY 
CAI EARH YQ CKVS R P S DAAG S E GRPRDM 
TQDTE VADTPPPNI SSI TATRTMAT IGV 
GWAHAYNPNTLGGQGRRIA*AQEFKTS 
LGNLAKP CLYKH 


13391 


27292 


A 


13534 


1941 


1040 


AFHLLPYLSSGFYCSLGPCFGLGSSHMA 
AWHKEPLHNACSDSRRSAPIRQGLGSPS 
ATHDTHTRTLAHI CKE I FKERLHE I KER 
EIDS*RG*NTQR*VEREEAENKYKPTEI 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y— Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














NTEIENHKQQEMLRHAGEERENSPREAE 
RKCRNE KCE I KSRNSRKEARLRKKVKFK 
TNE KGKG * Q 1 1 T VH KE KRE VGRE RRGRQ 
NKKRT KS TQHVTQ KGDMR * S VKE E PHLR 
RAKRNS NKMKNERY VLRQDTDHS CEQKE 
KMRQRHTRKKLARETWKWVRHTLNREQR 
AKPRKKETKEIHIHSAILKC 


13392 


27293 


A 


13535 


1192 


1476 


EIKRKWGPGASCHACNPITGRPRRVDHL 
RSGVQGQPGQRGETPSLLKNIQISWAWW 
CAPVI * E I KRKWGPGASCHACNP ITGRP 
RRVDHLRSGVQGQ PGQRGE TPS LLKN I Q 
I S WAWW CAPVI P AT W E AE AGG S LE TG S G 
GCTEPRSHHCTQAWVTE 


13393 


27294 


A 


13536 


55 


132 


RAES CS LAEDGVQ * GNLSSLQ P P P PK 


13394 


27295 


A 


13537 


7 . 


325 


S P VE FRLGE PTKGT S S FGKRRNKTHTL C 
RRCGS KAYHLQKS T CGK* GYP AKRKRKY 
NWS AKAKRRNT TGTGRMRHL K I VYRT FK 
HGFREGTTP KPKRAAVAASS S S 


13395 


27296 


A 


13538 


259 


274 


KFEAAVCYHCTTLSL**CHTVCFLSFFL 
LE IGYHYI TQAQSGAM IAHCSLKLLGS S 
NP PVP AF * VAETRS AC YNAWL VF VKFLM 
GL*ELWKDMS 


13396 


27297 


A 


13539 


3 


68 


QASVLPQKQLMSWQKLSESTFVKFWNLV 
KTGRLGEEISCCLWREYYSISDCLATI 
KL» PAS HL S MRKP RHKD FKST FL I S T KNQ 
GNHGHGT KGNEVNN FQQRISV* KTYFKK 
YNMRPGAVAHTCNP S TLGG * DWW I T*GK 
NCQNQLS 


13397 


27298 


A 


13540 


268 


468 


YRRLISKKFFKHRWKSVSFFKFSFFWLG 
EVAHACNPSTLGGQGGKIA*AQEFLTSL 
GNIVGPCLYKK 


13398 


27299 


A 


13541 


308 


437 


FSWAFEIGSQAEVQWHDLHSL*PLPPRF 
KRFSHLSLPSSWDYR 


13399 


27300 


A 


13542 


7 


244 


AKTAPLFF* FETVSCFVAQARGQWHDPG 
SLQPRPPGQETSMIKTSSDPPPPASQVA 
GATGMGHHAQKI * FLVETGS PHVSQGGL 
ALLK 


13400 


27301 


A 


13543 


50 


175 


ALPAIHTAVGQCSVEGFCVLSDPEGDGA 
ARMKLVRFLMKLSHETVTIELKNGTQVH 
GTITGVDVSMNTHLKAVKMTLKNREPVQ 
LETLSILGNNIRYFILPDSLPLDTLLAD 
AE PNVKSMNREAVAG * R * G * ARE I FDE I 
ES 


13401 


27302 


A 


13544 


47 


362 


VAFQGLQVPATNPANFFFFWKGGFFFVP 
QGGRQGEKIGLRDPWPPGVRKI PGPTLS 
GTREKGAP PPPPIYF* FF WKKGGQKGGP 
GGF*TWDPKGPPSPTPPKGGD 


13402 


27303 


A 


13545 


97 


351 


NKKKKG P P P F F * KKR VG KKKKKGARAGG 
P P P KP P P FGGPRGGG PRGQNS KT PR P KG 
GNPPFKKKKKKRNI F PAG P MGLKGGGHL 


13403 


27304 


A 


13546 


214 


32 


NYIKISSVDGGAKIK*WT*NLKCLVLSV 
KNITRWVDCLSSGVQDQPGQHGETPSLL 
KTRKTSWA* WRLGL I QRWRKMPWPSR 


13404 


27305 


A 


13547 


350 


159 


QTGTLSQKTKQKKQAQWLMTVILGL*EA 
KAGGLLE P KS LRPAWAT WQDP I YKKKFF 
LERKTWR 


13405 


27306 


A 


13548 


367 


149 


FFFFFFFF* TGSH YVAQNW TQ I P GLNLiP 
ASASQVTATTGMHHHTRHTPFPSLTSHT 
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nucleotide deletion, \=possible 
nucleotide insertion 














QSVSTASWTYLQNSGRH 


13406 


27307 


A 


13549 


321 


2 


GLVKVTHHGYDRNLGYPDSQAS VLSS FY 
TA*MVQ*LQLPDKVQIYPAHPLFINDMP 
P*WLLKIPSFGPGTVAHPYNPNTLGGRG 
RRTTRDREFETSLANVVKPHLH ' 


13407 


27308 


A 


13550 


216 


13 


GYSGKETVFSMRKGQQSGPCLQGPS I PM 
GKAGINQI I P KTTLGGRGGW I A* GQE FK 
TSLANTVKPCLY 


13408 


27309 


A 


13551 


199 


391 


KDFSIFIACFKRPFITGSQGQEQWLIFP 
HAYNPSILGGRGRQIT*GQEFLTPARLG 
NMVKPCLY 


13409 


27310 


A 


13552 


70 


239 


ATCVKN*RTSWAWWCVPWPATHEAAAG 
GLLEPRSSRLQCSVIASVERHRTSVWTT 


13410 


27311 


A 


13553 


105 


257 


GQAQWLMPVI PALWEAE AGGS FAPRS LR 
IiA*ATVPGPNNFLKRAFHSLLNLFLP 


13411 


27312 


A 


13554 


499 


165 


Y*MESHFVTQGGLQWCSLGSLQAPPPEV 
GGWLELGRQRLPMKPKI AP I WTPSPG 


13412 


27313 


A 


13555 


415 


719 


YTDKSIRSISLLPVKGSTHSMCPVKFIQ 
LQEAGG I HMAKLSGQRVDREWRLGT VAH 
TCNRSHLEGKGGQIT*AQVFKTSLGNMA 
KSCLCKKYQKLSWIWQH 


13413 


27314 


A 


13556 


224 


523 


DKVSLCHPGWECGWVQSQLTTASISQG* 
TILFPQSPE 


13414 


27315 


A 


13558 


355 


1 


Q VS LQL P YC VL FQ FH I YNKQAALLQRC Y 
VS ATOLL INAI WQ YVNS LKMCE ENQNVS 
LQNM P F * HRNYFGE AGHGGS CL * SQLLG 
RLRH *NCLNLGGGGCSE LRLRHCTPAWA 
TKASPS 


13415 


27316 


A 


13559 


73 


259 


KY VPHKGKI S ERRL CSVAQAGVQ WCDHG 
SRSLEVLGSNDLLASAS * VAGITGCWGS 
SDPHTT 


13416 


27317 


A 


13560 


205 


21 


WK I S EDFLLGDAKKWAYQED I KRLRLG 
RGAVSHTCNPSTWRGRGGWIT*DQEFET 
SLPKC 


13417 


27318 


A 


13561 


254 


481 


LMALL PGS SD VDLS E YGWEHFNF VIL I S 
KEIFILLFF* DGVSHLLPRLE CSGM I S A 
YCHLCLQGS SDRRDFLKKQ 


13418 


27319 


A 


13562 


174 


369 


TFFSPSMFVEPGPYYIAQAGVRWLFTGA 
IIVHCSLKLLALSDSPTSAS*VTATTGM 
PPHLAFTVK 


13419 


27320 


A 


13563 


371 


109 


LFSKAGRCILQNLLCCMVSGKCWPMNGI 
DVTREYPTFIAHRVIiTYAATSGSDFLGQ 
AQWLMPVIPAI *EANAGESLEPKSLRLA 
*ATVPGPRSLSQTLLHRSELCGL 


13420 


27321 


A 


13564 


206 


486 


RQLAAVHILVTPLPDHVTWANYVTSLSL 
LSS S ENGVNNDNTHF I GLLLRD YK* GW V 
* WLTP VI PALWEAKVGR IARGQELENS L 
GSKARPRFV 


13421 


27322 


A 


13565 


212 


398 


S RVRG C F P FNL P KS S C I * E CGQGAVAHA 
CNPSTLGGRGGQIA*AQEFKTSQGNVAK 
PHLYKI 


13422 


27323 


A 


13566 


26 


395 


YS P VHTD KCS GVRKLGL F F FF F FE KE FH 
FFSPAWKARAKIGLNGTPLSQGKGNFRP 
KPPKKRGKRGRAPQPGKILNFKKKRGFS 
MGAKVNPNFG P KGN P P P * P P KGAGKKGG 
TPRPGPFFFFF 


13423 


27324 


A 


13567 


400 


195 


ATRVSLAPTKKKINWGWWPTPVI PPPQE 
GEAGEFL* PQKWRVRLNKI VPLPSRPRD 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine> 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














KTRLCFKKKKKN 


13424 


27325 


A 


13568 


316 


154 


SQPFPGLRWEDRLSRGVQNQPKQ*LMPV 
IPAFSGAEVGGSLEPRSSKPA*ATNNLK 
SKVIIGGQICDKASKVSIVESSWWGWGC 


13425 


27326 


A 


13569 


297 


475 


FDYFFIY*ETKSHSAAQAGVQ*CKHSSL 
QP* PPGLKQSLLPWPPKSAGLTGVSYGT 
WPI 


13426 


27327 


A 


13570 


116 


332 


NRVLLLP KGE YRGT I SAHCS LKLLGS KR 
PPTSAA*GDDETTGVCHHTPV* FLKTFF 
IE IRPGYVPRVGHWRF 


13427 


27328 


A 


13571 


233 


405 


T VP E LRAS CRL KKKKKKKKGE KNLiKK I K 
GFPGPGPRV*GPEAKNPGGGGPGIPKGP 
G 


13428 


27329 


A 


13572 


408 


3 


TFCDYFKTPPPGKFFFINGPPSFFLNRG 
PSFFFFPGFPPPFLKNFFFFPLWFKGG 
GYKRGVFPPKKKGPPQPLFFFFFFFLI * 
KKTRPGTVAHPCNPSTLGGQGRWIT*GQ 
E FLS S LiAKMAGRWRPGNCE S V 


13429 


27330 


A 


13573 


273 


406 


I KEFKAERGGHS *NTS TFGGQGRWIA* S 
QEFETSLANMVKPHLY 


13430 


27331 


A 


13574 


385 


38 


KKNSPPPPGKNFFFF*GPPPSPFFFFFF 
FFFFFFFLGGKKTFFFPPPSRFFFFLKP 
FPPKNQGPKNFFPPKKKIGD*ARPPPLT 
GGAKGAPLFKKKKKKKKLKVRPGAVAHT 
CNS 


1 3 A 1 1 

1343 1 


27332 


A 


13575 


230 


53 


HWKVLKENKQ IFTKIS FKMNAI KDSVGQ 
AQWLMP VI PAL * E PDMGGP VE P KS LRP A 
WAT 


13432 


27333 


A 


13576 


406 


33 


VNSIVWVSSPFRVSTFISFFELKSCSVT 
QAGV*WHDLSLLQLQTPGLKRSPAPSRG 
LSFYLSS I FYPHSSQTELFVGTLNAASF 
TNSRLLYLSIiLLCGKYHPLVPSLPLSTI 
WYFSLRVSHHFP 


13433 


27334 


A 


13577 


426 


294 


GGFPSPPVFKPPPQFFFFGPLKKKISPP 
PPGEKKIFF*KAPPP 


13434 


27335 


A 


13578 


268 


3 


LQAKGPAMGE AGARCS SEVWGWPRKGFD 
L * VSR * PS FDSHVR I GRVQRLMPWPVL 
WETEADISFEPCSSRPAWASWRKTYIQK 
PQRI 


13435 


27336 


A 


13579 


395 


49 


BKKKFFVCFPGFPGNPPNPSPQFF*KGL 
IS PSGSLRTRRGFFP * RVFFFFLKKQFP 
LSPRVECNGIIPDP*PPFGEVTSTPQVA 
GAI GPPP P PR VNFYFF VKKGFLNVGPMF 
FFF 




27337 


A 


13580 


68 


466 


GAS P AQG S TLHLVI L P R Y F F KI P T VRTE 
SFSGSL.VTSPPPLHFLPLNKERGGPGIP 
LPCPQIjLQVLGGTSLPPVPSLPDTSQDK 
WPLHGVPPGHVCS*PLAGDGAWPPSPHW 
IPLNPGTSKSLQPAPPWNSA 


1 1 A 1 T 

13437 


27338 


A 


13581 


67 


249 


ATAPGLLCSYKIFSCQLQWCVPVTQIiAW 
"R TT*! A da <3 T >T? 7\. P ^ cj p t , * * tm t t 'D*7TvTDtj r* t 

xu X IIi^OOOXJXiiTJXOOX\.lJ " 1 l v l X 1 ir V INI ixti J_ 

IiAWAT 


13438 


27339 


A 


13582 


359 


3 


KQDSQWVAAAPAS VAVS TCWRGLPLPWS 
GGQNRGKGSLPLQVHRGA*GTENKNQGG 
TPRPGGGPGPSAPRGSSLGAHRKLPVHH 
TTLSSSRSALPPTPRHPAPSSPPCTEEF 
HRTRPI 


13439 


27340 


A 


13583 


174 


1 


FFF F F VE TE SHS VAQARVLE YS GAI P AQ 
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residue of 

peptide 
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Predict- 
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nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K==Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














CNLDNL S S S EL PAS AS * AS * VAAXTGT * 
KR 


13440 


27341 


A 


13584 


192 


206 


L.FFLEVGSCYVAQAGFEFPGSGHLPLSA 
S*WGSTGVPHHTQLV*SFSSLYNLERN 
C 


13441 


27342 


A 


13585 


244 


28 


KKPENQEINYIFYHLFNVHEQIRISLGP 
GAVAYNCNPSTLRD*GGRIT*AQEFKTN 
LGNMARPRTRGSTRTF 


13442 


27343 


A 


13586 


312 


701 


RGPEET*PRTRKSSPRPTRSCESVSKLA 
RPPRRCSAAPAQVPRLSLRSPKPDPPRV 
SSAEKCAPLPLLPECSETGALPRNSFLC 
- QNAS S PLLSLrGLPP S PTVQALQPRALHQ 
HLGS TNKEDAHVAP AKKK 


13443 


27344 


A 


13587 


410 


292 


GYM I KQQT VYHE CRM * ANSHLHL P P EGL 
LRAVTLWRRAP 


13444 


27345 


A 


13588 


428 


36 


TPPPFFFFWAQKKKKI FPPPGGKI FFFL 
KGPPPLFFFFFFFFFFFFFFLKKFFGLK 
G P FF F VQD * QGLEN P KEGS S LGWE KLGR 
AliLPQQRSFVPGLGPTRLFWGLVFWFFG 
FFFLPFGLCFFFALIFNC 


13445 


27346 


A 


13589 


3 


364 


TAPDDQGRP YQAGE PAAHVRADS THGLG 
A* RRVAGHLPPRVGAPHHPGASSREGAP 
PLHP P PNL* HRRPNGS CRFLPGPAAPPQ 
GAEGGRGRDRRGTHSVAQTGGPGGFGSG 
VTPSWRSS 


13446 


27347 


A 


13590 


778 


910 


DRIRRSGAVAHTCNPSTLGGQGERTA*G 
QEFKTSLGDMVRFCLY 


13447 


27348 


A 


13591 


230 


3 


KLVNILVEKSLVAMKSTVSEGKMVrRSP 
AS LGM VAHTHS PS TLGGQGRQ I A * AQE F 
ETILGNMAEPCLYKKHSVY 


13448 


27349 


A 


13592 


223 


340 


LTP 1 1 PEPWESEWGS LE * LTPGAVTHA 
YNPRTLGIRSGRIT*GHEFQTTLSHTMK 
TRLYHELL 


13449 


27350 


A 


13593 


35 


220 


DRASLSPRLECSGMIVAHCSINLSGSSD 
PLTPAS * GD GTTD T CHHARL I FTGADFC 
IDASS 


13450 


27351 


A 


13594 


87 


369 


NVQKTECE I SGKMQ I WQ KGS VKTKCSNG 
LFNFP I FSKKI ASCKVKKLTRE * KYKSQ 
ARWCT P VI PATEAE VRGQIiE PRS LRPAW 
ATYQPHFKS 


13451 


27352 


A 


13595 


261 


489 


SWVNERMWG* *KHETSQAQWLMP I IPA 
LCEAEVGGSLEARSWETSLCNIVRLHLS 
KKKRGKGRKRGGNQIAPSRE 


13452 


27353 


A 


13596 


238 


2 


NDCLWWLFRI PANVSTFGLLHMSLKVNT 
PGNNRFKSELGTRCLIHTCNPS ILGGQG 
GW I T * GQ E F VTS MQ KG S AVKNQ 


13453 


27354 


A 


13597 


36 


687 


RDVHRSTYQAGSKQDWGPGEAERLSSSR 
RGAYSCPVP ITFAEGKTRMGRDKMRLIL 
GLSWGPSLTLLLPITNAPRRPGM*EPAL 
PGNSTS * SGAVRFPGGGRK*EDSAESWS 
CFCSHPSPSSGPNPPSPPSAVCMLPGLS 
GLQR*PKQLSPPPALSLPVSSLLVRLSP 
WPPTSSHLLPQALPQSLHPQGSHRAVND 
SFSM*GMVLGSRRNRGCMEA 


13454 


27355 


A 


13598 


367 


468 


KKQRRGRE PWLTPAI PALWKAKAGGS P * 
VQSSRP 


13455 


27356 


A 


13599 


183 


380 


LTSMIA.VDNS S VE KTC PQAMRI SRPGAV 
AHTCNPS I LGGQGGL I A* VQE YETNLGS 



1361 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MAKTHLYKN 


13456 


27357 


A 


13600 


161 


657 


LIQGCWCSLFFSTRVFLAAIGMKSQWCR 
P VAMDLGVYQLRHFS I S FLS S LLGTEN A 
SVRLDNSSSGASWAIDNKIEQAMDLVK 
SHLMYAV*EEVEVLKEQIKELIEKNSQLi 
EQENNLLKTLASPEQLAQFQAQLQTGS P 
PATTQPQGTTQPPAQPASQGSGPTA 


13457 


27358 


A 


13602 


121 


423 


MIKVNSFGLYLYSQVIYIYIMKYYTAIY 
LKDFFFKDGVLTRHPRWNTVTTQCSLE I 
LGSTDPPASAS*AAGTTNTSHHGQLYSY 
FKKRPTYAEMEQYKI KK 


13458 


27359 


A 


13603 


1889 


1717 


VPRIRPLSWTPPSS I *RPKPPSSTFSHP 
GKPSMGPPGLNRCPVQRALLPGWYQHCG 
R 


13459 


27360 


A 


13604 


81 


267 


ETCC I KNNTLK* I * SQMLWCAP I VP AT W 
QAEVGGSLDPRSSSL*CAMITSVNNHCT 
PAWAR 


13460 


27361 


A 


13605 


267 


418 


THASGMVTHTCNPGTLRVSAGRIT*GQE 
FKTSLGNIARPHLYKI I INKY I 


13461 


27362 


A 


13606 


428 


24 


CPESNPRKREPTLKSEPILTLPITGLIL 
DDGNSPTAAAGSVFAETLQPQCSSLCC* 
VKSSPWPTASLPQPFGSAPQTLPVRYAL 
IPGPSLVSTSLGPRLjAPGLPGSPSPPSP 
QVTLDSERYPSPTPCIAPALED 


13462 


27363 


A 


13607 


267 


442 


RP PPTLKVPWLGVE VHACNPRTLGGQGG 
SIS*AQKLENSLSNILRPHLYQKQQKQQ 
KK 


13463 


27364 


A 


13608 


3 


280 


YRLSVICEDPMREREQTCFPPPPSFPPN 
RYCSTRDGGMEGSQGDFRKGFPA*APCR 
SGGGWRGCCSPAGAPGSPLCGSSQGGPV 
SPWGQKRS 


13464 


27365 


A 


13609 


291 


248 


E KRAHGS Q PM KGRERNS MCKS S E FRWA 
WS LAGVKGDGGR * GSGAGGS P KLG * AV* 
LDPEGHAGQAVFCGEAGARTQ I GG KG *R 
RDQAPLPLSPNLCSGSCLPTKHSLARMP 
LRVKSYSSAQLGRTTCTRPSPPTITFDS 
S 


13465 


27366 


A 


13610 


267 


3 


HTDLL YNTP THP I S PRCDPKHTAI PDKQ 
SLLFFFFEMESCSVARAGVRWHDLRSLR 
PLPPRF* KFFCCS I PSLYCGRSRGSSQT 
YTS 
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M 

eth 
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NO: in 
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09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=JLysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


13470 


27371 


A 


13615 


197 


428 


KTMVGLKTSAS FHQDF I Q I PLNHNLSVP 
* TMLG S QPRFLFPP GT VS H I CNPS T LGG 
RGGW IT* GQE FET S LATMVKP 


13471 


27372 


A 


13616 


445 


550 


I S I TRGRARWLTP VILVLWEAEAGGS P * 
GRSSRLP 


13472 


27373 


A 


13617 


3 


435 


TMVLS P ADKTNVKAAWR KYGAHAGE YGA 
EALERMFLSFPTTKTYFPHFDLSHGSAQ 
VKGHGKKE ADALTNAVAH VDDMPNAL S V" 
LSDLHAHKLRVDPVNFKLLSHCLLVTLD 
RHLPAEFTPAVHASLDKFLASVSTVLTS 
KYR 


13473 


27374 


A 


13618 


719 


918 


CEGRRE KWK I GRE RKGRGRRKGEQMGRE 
GKGREDGGRKRPCTSRPRSSSRDRSNII 
ISEHASAVEA 


13474 


27375 


A 


13619 


847 


938 


WLMPAVPALLEAE VGI S LEAQS CRP PWA 
TQ 


13475 


27376 


A 


13620 


349 


409 


NLWS WPGAVAHVCNPS TLGG 


13476 


27377 


B 


13621 


25 


189 


MVLS PAD E TNVKAAWG KVGAHAGE YGAE 
ALERMFLCFPTTMTYFPDFDLIHGSAQ* 


13477 


27378 


A 


13622 


142 


356 


RIVENEKINAEKSSKQKVDLQSLPTRAY 
LDQTWP ILLQGLAVLAKERP PNPIEFL 
ASYLLKNKAQFEDRN 


13478 


27379 


A 


13623 


18 


1353 


AG AAQ CEWS AGE AGARTMS E ADGLRQR 
RPLRPQWTDDDGQAPEAKDGSSFSGRV 
FRVTFLMLAVSLTVPLLGAMMLLESPID 
PQPLRQISGIALFCSFKEPPLLLGVLHP 
NTKLRQAERLFENQLVGPES IAHIGDVM 
FTGTADGRWKLENGEIETIARFGSGPC 
KTRDDE PVCGRPLG IRAGPNGTLFVADA 
YKGLFEVNPWKREVKLLLSSETPIEGKN 
MSFVNDLTVTQDGRKIYFTDSSSKWQRR 
DYLLLVMEGTDDGRLLEYDTVTREVKVL 
LDQLRFPNGVQLS PAEDFVLVAETTMAR 
IRRVYVSGLMKGGADLFVENMPGFPDNI 
RPSSSGGYWVGMSTIRPNPGFSMLDFLS 
ERPWIKRMIFKLFSQETVMKFVPRYSLV 
LELSDSGAFRRSLHDPDGLVATYISEVH 
EHDGHL YLGS FRS P FLCRLS LQAV 


13479 


27380 


A 


13624 


18 


1353 


AGAAQCEWS AGE AGARTMS E ADGLRQR 
RPLRPQWTDDDGQAPEAKDGS SFSGRV 
FRVTFLMLAVS LT VPLLGAMMLLES P I D 
PQPLRQ I SGIALFCS FKE PPLLLGVLHP 
NTKLRQAERLFENQLVGPES IAHIGDVM 
FTGTADGRWKLENGEIETIARFGSGPC 
KTRDDE P VCGR P LG I RAG PNGTL F VADA 
YKGL FE VNP WKRE VKLLLS SET P IEGKN 
MSFVNDLTVTQDGRKIYFTDSSSKWQRR 
DYLLLVMEGTDDGRLLEYDTVTREVKVL 
LDQLRFPNGVQLS PAEDFVLVAETTMAR 
I RRVYVS GLMKGGADLFVENMPGFPDN I 
RPSSSGGYWGMSTIRPNPGFSMLDFLS 
ERPWIKRMIFKLFSQETVMKFVPRYSLV 
LELSDSGAFRRSLHDPDGLVATYISEVH 
EHDGHL YLGS FRS P FLCRLS LQAV 


13480 


27381 


A 


13625 


1 


384 


QSFRGTGRKRERERKRMSLSDWHLAAKL 
ADQPLTPKS ILRLPETELGE YSLGGYS I 
S FLKQL I AGKLQES VPDPEL IDLIYCGR 
KLLDDQTLDFYGIQPGSTVHVLRKSWPE 
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E=Glutamic Acid, F=OPhenylaIanine, 


sequence 


sequence 




09/515,1 
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location 
correspond 
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amino acid 
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peptide 
sequence 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan 7 Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














PDQKPEPVDKEAAMRD 


13481 


27382 


A 


13626 


980 


1089 


KRIRIQLTGGLYPVPPPHPPPQSPPIFP 
RPTSPTRT 


13482 


27383 


A 


13627 


116 


587 


VCGELRADSWPVPSQPEQASGPQKQAFI 
WPEAPSRSARLPITYTDYWDSRLQTQES 
QMLGSMARKKPRNTSRLPLALNPLKSKD 
VLAVLAERNE AI VP VGAWVE PAS PGS S E 
IPAYTSAYLIEEELKEQLRKKQEALKHF 
QKQ VKYRVNQQ I TLRKQ 


13483 


27384 


A 


13628 


150 


484 


VAFPQASRGVRKTEVGEQQGQGTACGGC 
GHQ C P P P TWH I QRAHC VP S T CGAGGWHL 
QGRAS CRRPPSQS PQVYQTQVGRQDPHF 
GVGRDSRGELAWS SFIHPYLLSVCNPL 


13484 


27385 


A 


13629 


330 


397 


ARCEWLTPVI P AIi WE AEAGGS H 


13485 


27386 


A 


13630 


10 


137 


S FTGAVILI I AHGLTS S LLFCLANSNYE 
RTHNLFFLNDFFFS 


13486 


27387 


A 


13631 


380 


98 


FQWLVS F TATLW FE ERHRKNKI E RE E RR 
AKGERGDRKEERREEGGEERGRRGGERS 
DKRE PKKKS KEESNHPKWKFS FX CS FC 
FLTPFFPVFF 


13487 


27388 


A 


13632 


56 


827 


P L FE AFTACG F VHD CGLL I H P E E TCGL Q 
PISSDYIEAILQSELKRCPSGDMKGQWI 
VPCLSCSDNRTCDWREITWQPHNCQYGV 
LTKPQLQQCLGGRKILFIGDSTNRGIMY 
YLIERLNETLQEWQKVHGTKFYHNVNGG 
KTL I S YSYYPQFW I SPSLRPTFENALEH 
LLQRSRPLENTGQTVLWGGVQWLNSNH 
LQI I HKVLKS P FTTLNQ P VTKS CLQAI Y 
FPRLS PTLHSNCLDLVYS FTKS FNTYFV 
VQFLN 


13488 


27389 


A 


13634 


3 


2718 


SGPCRTTVAPLLRAAPVEHCVAALRPTD 
STMLKKFDKKDEESGGGSNPFQHLEKSA 
VIiQEARVFNETPINPRKCAHILTKILYL 
INQGEHLGTTEATEAFFAMTKLFQSNDP 
TLRRMCYLTIKEMSCIAEDVI IVTSSLT 
KDMTG KE DN YRG P AVRALCQ ITDSTMLQ 
AIERYMKQAIVDKVPSVSSSALVSSLHL 
LKCSFDWKRWVWEAQEAASSDNIMVQY 
HALGIiL YHVRKNDRLAVNKM I S KVTRHG 
LKSPFAYCMMIRVASKQLEEEDGSRDSP 
LFDFIES CLRNKHEMWYEAASAI VNLP 
GCSAKEIiAPAVSVLQLFCSS PKAALRYA 
AVRTLNKVAMKHPSAVTACNLDLENLVT 
DSNRSIATLAITTLLKTGSESSIDRLMK 
QISSFMSEISDEFKVWVQAISALCQKY 
PRKHAVLMNFLFTMLREEGGFEYKRAIV 
DCI I S I IEENSESKETGLSHLCEFIEDC 
EFTVLATRILHLLGQEGPKTTNPSKYIR 
F I YMRWLEHE E VRAGAVS ALAKFGAQN 
EEMLPS I LVLLKRCVMDDDNEVRDRATF 














YLNVLE QKQKALNAGYILNGLTVS IPGL 
E RALQQ YTLE P SE KP FDLKS VPLATAPM 
AEQRTESTPITAVKQPEKVAATRQEIFQ 
EQLAAVPEFRGLGPLFKSSPEPVALTES 
ETEYVIRCTKHTFTNHMVFQFDCTNTLN 
DQTLENVTVQMEPTEAYEVLCYVPARSL 
PYNQPGTCYTLVALPKEDPTAVACTFSC 
MMKFTVKDCDPTTGETDDEGYEDEYVLE 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DLEVTVADHIQKVMKLNFEAAWDEVGDE 
PEKEETFTLSTIKTLEEAVGNIVKFLGM 
HPCERSDKVPDNKNTHTLLLAGVFRGGH 
D I LVRSRLiLiLLDTVTMQVTARS LE ELP V 
DIILASVG 


13489 


27390 


A 


13635 


118 


629 


Lt WALQLH P E P E L P S RRGTGAAG VWTG VA 
M IRF I L I QNRAGKTRLAKW YMQ FDDDE K 
Q KL I E E VHAVVT VRDAKHTN F VE FRNF K 
1 1 YRRYAGLYFCI CVDVNDNNLAYLEAI 
HNFVEVLNEYFHNVCELDLVFNFYKVYT 
WDEMFLAGEIRETSQTKVLKQLLMLQS 
LE 


13490 


27391 


A 


13636 


57 


221 


LHHCTPPWAEVEETLKRLQSQKGVQGI I 
WNTEGGWEPLGHCGDRSRPPAQGCP 


13491 


27392 


A 


13639 


344 


544 


LSGGHAGALSSLWVHLYCLLSSQQLLGN 
VLVTVLAIHFGKEFTPEVQASWQKMVTG 
VCSALCFRYH 


13492 


27393 


A 


13640 


319 


623 


DMEEAS EGGGNDRVRNLQS E VEGVKNI M 
TQNVERILARGENLEHLRNKTEDLEATS 
EHFKTTSQKVARKFWWKKVKMIVIjICVI 
VFIIILFIVIjFATGAFS 


13493 


27394 


A 


13641 


2099 


769 


trlagrvs vasrpcrgpavggllvers k 
arrpllesrvamaavpellqqqeedrsk 
lrsvsvdlnvdpslqxdipdalserdkv 
kftvhtkttlptfqspefsvtrqhedfv 
wlhdtliettdyagli ippaptkpdfdg 
prekmqklgegegsmtkeefakmkqele 
aeylavfkktvsshevflqrlsshpvls 
kdrnfhvfleydqdiisvrrkntkemfgg 

FFKSWKSADEVLFTGVKEVDDFFEQEK 

nflinyynrikds cvkadkmtrshknva 
ddyihtaaclhslaleeptvikkyllkv 
aelfeklrkvegrvssdedlklteldry 
ymln i e aakdll yrrt kal i d yensnka 
ldkarlkskdvklaeahqqeccqkfeql 
s e s akee l infkrkrvaafrknl i emse 

LE IKHARNNVSLLQS C IDLFKNN 


13494 


27395 


A 


13642 


210 


772 


SVKMVRYSLDPENPTKSCKSRGSNLRVH 
F KNTRE TAQAI KGMH I RKAT KYLKD VT L 
QKQCVP FRRYNGGVGRCAQAKQWGWTQG 
RWPKKSAEFLLHMLKNAESNAELKGLDV 
DS LVI EHI QVNKAP KMRRRT YRAHGRIN 
PYMSSPCHIEMILTEKEQIVPKPEEEVA 
QKKKISQKKLKKQKLMARE 


13495 


27396 


A 


13643 


168 


2172 


S PLCE VS VP S FCFRVFCKKHKKHKSDKH 
LYEEYVEKPLKLVLKVGGNEVTELSTGS 
SGHDSSLFEDKNDHDKHKDRKRKKRKKG 
EKQIPGEEKGRKRRRVKEDKKKRDRDRV 
ENEAEKDLQCHAPVRLDLPPEKPLTSSL 
AKQEEVEQTPLQEALNQLMRQLQRKDPS 
AFFSFPVTDFIAPGYSMIIKHPMDFSTM 
KEKIKNNDYQSIEELKDNFKLMCTNAMI 
YNKPETIYYKAAKKLLHSGMKILSQERI 
QSLKQS IDFMADLQKTRKQKDGTDTSQS 
GEDGGCWQREREDSGDAEAHAFKSPSKE 
NKKKDKDMLEDKFKSITNLEREQEQLDRI 
VKESGGKLTRRLVNSQCEFERRKPDGTT 
T LGLLH P VD P I VGE P G YCP VRLGMTTGR 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LQ S G VNTLQG F KE D KRNKVT P VL YLN YG 
PYSSYAPHYDSTFANISKDDSDLIYSTY 
GEDSDLPSDFS IHEFIATCQDYPYVMAD 
SLLDVLTKGGHSRTLQEMEMSLPEDEGH 
TRTLDTAKEMEQITEVEPPGRLDSSTQD 
RL I ALKAVTNFG VP VE VFD S E E AE I F QK 
KLDETTRLLRELQEAQNERLSTRPPPNM 
ICLLGPSYREMHLAEQVTNNLKELAQQV 
TPGD I VSTYGVRKAMGI S I PS P VMENNF 
VDLTEDTEEPKKTDVAECGPGGS 


13496 


27397 


A 


13644 

• 


1032 


207 


PADVTPKPATEAVQSEHSDASPMSINEV 
ILS ASGACKL IDSLHSYCFS SRQNKSQV 
CCLREQVEKKNGELKSLRQRVSRSDSQV 
RKLQEKLDELRRVSVPYPSSLLSPSREP 
PKMNPWEPLSWMLGTWLSDPPGAGTYP 
TLQPFQYLEEVH I SHVGQPMLNFSFNS F 
HPDTRKPMHRECGFIRLKPDTNKVAFVS 
AQNTGWEVEEGEVNGQELCIASHS IAR 
ISFAKEPHVEQITRKFRLNSEGKLEQTV 
SMATTTQPMTQHLHVTYKKVTP 


13497 


27398 


A 


13646 


148 


380 


RGSWREVPESASLPSRGAKGKWRGLCCC 
CCCCCCCCCCCCYHCHQEQGDLKHQADL 
WRSGRTQNQAGIWQEHQTLEG 


13498 


27399 


A 


13647 


2099 


769 


TRLAGRVS VAS R P C RGP AVGGL L VE R S K 
ARRPLLESRVAMAAVPELLQQQEEDRSK 
LRSVSVDUSIVDPSLQIDIPDALSERDKV 
KFT VHTKTTLPTFQS PE FS VTRQHED F V 
WLHDTLIETTDYAGLIIPPAPTKPDFDG 
PREKMQKLGEGEGSMTKEEFAKMKQELE 
AE YLAVFKKTVS SHE VFLQRLS SHPVLS 
KDRNFHVFLEYDQDLSVRRKNTKEMFGG 
FFKSWKSADEVLFTGVKEVDDFFEQEK 
NFL INY YNRI KDS CVKADKMTRSHKNVA 
DD Y I HTAACLHS LALEE PT VI KKYLLKV 
AELFEKLRKVEGRVSSDEDLKLTELLRY 
YMLN I E AAKDLL YRRTKAL I DYENSNKA 
LDKARLKSKDVKLAEAHQQECCQKFEQL 
SESAKEELINFKRKRVAAFRKNLIEMSE 
LEIKHARNIWSLLQSCIDLFKNN 


13499 


27400 


A 


13648 


1 


1206 


MSTSQS PCES I CD YVTSHDKSNFTDMI K 
LIST I LRCEVI LDYPAPGGGSLGAKHCCS C 
YTVS S GVTEGERNAGEKGVKLNADGAR I 
RGTPGRGRRAEAE AS S PAP AAVAAAC W 
AAAAAS RQLASGNRTRVS S GVPAPAFLG 
TMNPNCARCGKIVYPTEKVNCLDKFWHK 
ACFHCETCKMTLNMKNYKGYEKKPYCNA 
HYPKQSFTMVADTPENLRLKQQSELQSQ 
VRYKEEFEKNKGKGFSWADTPELQRIK 
KTQDQISNIKYHEEFEKSRMGPSGGEGM 
EPERRDSQDGSSYRRPLEQQQPHHIPTS 
APVYQQPQQQPVAQS YGGYKEPAAPVS I 
QRSAPGGGGKRYRAVYDYS AADEDEVS F 
QDGDT I VNVQQ I DDGWM YGTVERTGDTG 
MLPANYVEAI 


13500 


27401 


A 


13649 


3 


394 


GDGGGHLGSGRNGGGSMNAPPAFESFLL 
FEGEKITINKDTKVPNACLFTMNKEDHT 
LGNI I KSQLLKDPQVLFAGYKVPHPLEH 
KIIIRVQTTPDYSPQEAFTNAITDLISE 



1366 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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LSLLEERFRVRAGPGGAD 


13501 


27402 


A 


13650 


1 


471 


SRPTGLREAAGSGPREAPRRSGCKSPGL 
GTVGMLRPKALTQVLSQANTGGVQSTLL 
LNNEGSLLAYSGYGDTDARVTAAIASNI 
WAAYDRNGNQ AFNEDNL KF I LMDCMEGR 
VA I TRVANLLL CM YAKE TVG FGMLKAKA 
QALVQYLEEPLTQVAAS 


13502 


27403 


A 


13651 


1334 


82 


CYTGGTQSLWPGSSCASSVARPSSLFRS 
AWSCEWSVRCARACTMSVPAFIDISEED 
QAAE LRAYL KS KGAE I S E ENS EGGLHVD 
LAQI I EACD VCLKEDDKDVES VMNS VVS 
LLL I LE PDKQEAL I ESLCEKLVKFREGE 
RP S LRLQLL SNL FHGMD KNT P VR YT VY C 
SLIKVAASCGAIQYIPTELDQVRKWISD 
WNLT TE KKHT LiLRL L YE AL VD C KKS DAA 
SKVMVELLGS YTEDNASQARVDAHRC I V 
RALKDPNAFLFDHLLTLKP VKFLEGEL I 
HDLLTI FVSAKIiAS YVKFYQNNKDFIDS 
LGLLHEQNMAKMRLLTFMGMAVENKE I S 
FDTMQQELQ I GADD VE AF V I DAVRTKM V 
YCKIDQTQRKVWSHSTHRTFGKQQWQQ 
LYDTLNAWKQNLNKVKNSLLSLSDT 


13503 


27404 


A 


13652 


1 


377 


TTASGRSGVKGSTMSAEVPEAASAEEQK 
E ME D KVT S PE KAE E AKL KAR Y PHLGQKP 
GGSDFLRKRLQKGQKYFDSGDYNMAKAK 
MKNKQLPTAAPDKTEVTGDHIPTPQDLP 
QRKPSLVASKLAG 


13504 


27405 


A 


13653 


424 


596 


SLKNI YGLSCRKKKKGAVKKI ILVQAWW 
LMPVI TVLWEAE VGGLLEARGLRPTRAT 
W 


13505 


27406 


A 


13654 


190 


448 


LRSYPAPHLGSPELRIRKGRGHSHCIoAG 
AAGP QRTALCGL S AP LC P P S P T P P GAGA 
PRYCSGSDAPPCLLRGAGPPIPGMGDPE 
TS 


13506 


27407 


A 


13657 


1612 


563 


SMPGWRLLTQVGAQVLGRLGDGLGAALG 
PGNRTHIWLFVRGLHGKSGTWWDEHLSE 
ENVP F I KQLVS DEDKAQLAS KLCPLKDE 
P WP IHPWE PGS FRVGLI ALKLGMMPLWT 
KDGQKHWTLLQVQDCHVLKYTS KENCN 
GKMATLS VGGKTVS RFRKATS I LE FYRE 
LGLPPKQTVKI FN I TDNAAI KPGTPLYA 
AHFRPGQ YVDVTAKT I GKGFQGVMKRWG 
FKGQPATHGQTKTHRRPGAVATGD IGRV 
W PGT KMPG KMGN I YRT E YGLKVWR I NTK 
HNI I YVNGSVPGHKNCLVKVKDSKLPAY 
KDLGKNLPFPTYFPDGDEEELPEDLYDE 
NVCQPGAPS ITFA 


13507 


27408 


A 


13658 


128 


2626 


NSHRWVYVRARRWRRRGKQREQPEDRGV 
PMKRAAMALHSPQY I FGDFS PDE FWQFF 
VTPRS S VE L P P YS GT VLCGTQAVD KLPD 
GQEYQRIEFGVDEVIEPSDTLPRTPSYS 
ISSTLNPQAPEFILGCTASKITPDGITK 
EASYGS IDCQYPGSALALDGSSNVEAEV 
LENDGVSGGLGQRERKKKKKRPPGYYSY 
LKDGGDDS ISTEALVNGHANSAVPNSVS 
AE DAE FMGDMP P S VT PRTCNS P QNS TD S 
VSDIVPDSPFPGALGSDTRTAGQPEGGP 
GADFGQSCFPAEAGRDTLSRTAGAQPCV 
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GTDTTENLGVANGQ I LESS GEGTATNGV 
ELHTTESIDLDPTKPESASPPADGTGSA 
SGTLPVSQPKSWASLFHDSKPSSSSPVA 
YVETKYS P PAI S PL VS EKQVEVKEGLVP 
VSEDPVAIKIAELLENVTLIHKPVSLQP 
RGL INKGNWCY INATLQALVACP PMYHL 
MKF I PL YS KVQRP CTSTPM IDS FVRLMN 
E FTNM P VP P KPRQ ALGDK I VRD I RP GAA 
FEP T Y I YRLLT VNKS S LSE KGRQEDAE E 
YLG F I LNGLHE E MLNL KKLL S P S NE KL T 
ISNGPKNHSVNEEEQEEQGEGSEDEWEQ 
VGPRNKTS VTRQADFVQTP I TGI FGGHI 
RSWYQQSSKESATLQPFFTLQLDIQSD 
KIRTVQDALESLVARESVQGYTTKTKQE 
VEISRRVTLEKLPPVLVLHLKRFVYEKT 
GGCQKLIKNIEYPVDLEISKELLSPGVK 
NKNFKCHRTYRLFAWYHHGNSATGGHY 
TTD VFQ I GLNGWLR I DDQT VKV I NQ YQ V 
VKPTAERTAYLLYYRRVDLL 


13508 


27409 


A 


13659 


42 


382 


T LWLKT I Q I YLT I S LGLGS D YGLAGFS A 
LGCHQAAVKMLTAFILIQGLDLGRSHFQ 
AHSGCWQDSFPCDSRIHGGLLLHVQQES 
P WFQS AERVS C I TKCNHRSDTHHLCS I L 

L 


13509 


27410 


A 


13660 


178 


349 


DMGPCYVAQTVLKLLGSSDPPTSASQEA 
GTTCHHAHLLSHPFLMIYRNFSFIQCLE 
T 


13510 


27411 


A 


13661 


268 


2 


RQQKVILSSSGSLMSEMGRTVLPLKSPG 
MNPSSPLLASGGCWKSLACGHIIPFSSF 
ILMWPSPLCVSVSSSLPMRKPLLLHLRS 
TLFQ 


13511 


27412 


A 


13662 


112 


483 


AGVGALRMVQRLTYRRRLSYNTASNKTR 
LSRT PGNR I VYL YTKKVGKAP KS ACGVC 
PGRLRGVRAVRPKVLMRLSKTKKHVSRA 
YGGSMCAKCVRDRIKRAFLIEEQKIWK 
VLKAQAQSQKAK 


13512 


27413 


A 


13663 


2 


873 


S VE E FDRG CTGRGCGADARAGAAMVK I S 
FQPAVAGI KGD KAD KAS AS APAP AS ATE 
I LLT P ARE E Q P PQHR S KRGS S VGGVCYL 
SMGMWLLMGLVFASVYIYRYFFLAQLA 
RDNFFRCGVLYEDSLSSQVRTQMELEED 
VKI YLDENYERINVPVPQFGGGDPADI I 
HDFQRGLTAYHDISLDKCYVIELNTTIV 
LPPRNFWELLMNVKRGTYLPQTYI IQEE 
MWTEHVSDKEALGSFIYHLCNGKDTYR 
LRRRATRRR I NKRGAKNCNAI RH FENT F 
WETLICGW 


13513 


27414 


A 


13664 


118 


3 


AWSL I P VI P WRE AKAGGLLE PR S LRPT 
WATWQDPVST 


13514 


27415 


A 


13665 


1 


2876 


IRQRINFSRKNKWDSRKLDQAGVSELAT 
NQ KL I L VCGR YE G I DE RVI QTEIDEEWS 
IGDYVLSGGELPAMTLIDSVSRFIPGVL 
GHEAS ATEDS FAEGLLDCPHYTRPE VLE 
GMEVPPVLLSGNHAE IRRWRLKQSLGRT 
WLRRPELLENLALTEEQARLLAEFKTEH 
AQQQHKHDGMAADEAGRTFLRADFNMIE 
EGDRIMVCLSGGKDSYTMLEILRNLQQS 
AP I NFS LVAVNLDQ KQ PG FP EHVL P E YL 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F— Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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E KLGVE YK I VEENT YG I VKE KI P E GKT T 
CS LCS RLRRGI L YRTATELGATKI ALGH 
HRDDILQTLFLNMFYGGKMKGMPPKLMS 
DDGKHIVIRPLAYCREKDIQRFADAKAF 
P I I P CNLCGS QPNLQRQVI ADMLRD WD K 
RYPGRIETMFSAMQNWPSHLCDTNLFD 
FKGITHGS E WNGGDLAFDREE I PLQPA 
CWQ P E E DENQLDE LRLNVVE P DG P RHRH 
APDARRFCQYLRHQARYLLSWPNLDVA 
VTE PLANGDGLNVL I KRE WGFRANT VE 
KTGENQ YRVW PNEM P AD LHK I R PHHPLN 
RNLiDHF F P LDLTNS T F S GG YVHVL KGVL 
SDDLLLKSFQKMGYVRRDSHRLMVTALP 
PACQLVQVALGC FALRL.ECE I LGE VLAQ 
LGTS VLPAEELLQARRASGDVAS CQRLA 
QDEEPPPLPPRGSPAAYRAPLDLYRDLQ 
EDEGSEDASLYGE PS PGPDS PPAELAYR 
PPLWEQSAKLWGTGGRAWEPPAEELPQA 
SSPPYGALEEGLEPEPSAFSFLSLRREL 
SRPGDLATPESSAAASPRRIRAEGVPAS 
AYRSVSEPPGYQGTQLPVPWRPAHPLLR 
HLSPAACCPLCSPARLPSRPLAACAAWR 
R P AAL VATACTDGH S ALQ QP RG PALGQL 
GPRSRALGRQLPWCFSLLGRGLGLWLPC 
PWGSPGHSRGLESPGQPLLEVGLSVYGR 
LQGQQA 


13515 


27416 


A 


13667 


76 


289 


SGTPASPCLEMDPNCSCSPVGSCACAGS 
CKCKECKCTSCKKSCCSGCPVGCAKCAQ 
GG I CKGTSDKGSCCA 


13516 


27417 


A 


13669 


2 


667 


GRVDAE Q S RLGATE RAAAAAMNP E YD YL 
FKLLLIGDSGVGKSCIiIiLRFADDTYTES 
YISTIGVDFKIRTIELDGKTIKLQIWDT 
AGQERFRTITSSYYRGAHGIIWYDVTD 
QES YANVKQWLQE IDRYASENVNKLLVG 
NKSDLTTKKWDNTTAKEFADSLGIPFL 
ETS AKNATNVEQAFMTiyiAAE I KKRMGPG 
AASGGERPNLKIDSTPVKPAGGGCC 


13517 


27418 


A 


13670 


176 


398 


RIIiKTQLQENNQPTTTTKNRQKTRTDTL 
PKGIYRRHISLWKLGTWLTPVIPALCEA 
NAGGRLE SRS SRPVWAT Y 


13518 


27419 


A 


13671 


149 


247 


RKGLALSPRLDCSGAITVNCSLNLQGLS 
NIPI 


13519 


27420 


A 


13672 


257 


359 


VAS LHGWAQWLTP VI PTLWKTKVGEFFE 
PRCSRP 


13520 


27421 


A 


13673 


142 


1 


HL KGSGLAKHQ WLM P V I PALWEAEVGGS 
LEPRSSRPVWATLQDPISS 


13521 


27422 


A 


13674 


619 


168 


M F I E S C CT P K I YT S F VLRQ SHS FAQAGV 
QWLDLGSLQPLSLGS SNSPASAS C I QVL 
FVI YTSVKLGVGGRLGNKKVMS SNKKFV 
CECRGWI ILPFSVPSLRFQDGGWDHGVS 
SAE KALQAGRTAGHRVGVPPLGAP CGGA 
PCTTPPAWRS 


13522 


27423 


A 


13675 


3 


370 


ARALLHPRLHQWQCLCHLQGTGGCPQQS 
PS S EGRADLLS PALEEG YPPPASGCFI P 
HFL CGCS I FNCLP P I MRHAHKS PDALLE 
TLGPCRAWRLMP VI PTLWEAKVDRLLES 
RSSRPSWPTW 


13523 


27424 


A 


13676 


3 


281 


IGLRPELSVSLGRQRLTAIRLQPLLPAR 
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TPCGFLRARVALALMADETDLPLLPRLE 
YSGMITAYCSLDFLGSSDPPTLGSGVAG 
TTENCCED 


13524 


27425 


A 


13677 


272 


389 


YCRQ I E KS KKKRCWLGT VAHACNPS TWG 
GRGGRITRSRD 


13525 


27426 


A 


13678 


261 


363 


NSSWGHARWLTSVIPALWEAEVGGSPEV 
RSLRPV 


13526 


27427 


A 


13679 


149 


44 


S CVNS VS VLKKM YWLGAVAHACNPSTLG 
GQGGRTA 


13527 


27428 


C 


13680 


118 


267 


MSHCTWLMCVCVCVCVCGVYAYMYTYS F 
LRWDLPMLSRLECSGYSQAQ* * 


13528 


27429 


A 


13682 


16 


346 


NHHLTQPHPHLDINDFFFFFFFFFFFFF 
FFFFFFFFFFFLKKSPRPAGKKKKKKKK 
KKKPPRFFFLGGAPPPLFFFLINFFFFF 
L CK 1 1 S L V YNRRGG P CL S KRF FL KE RKS 
PL 


13529 


27430 


A 


13683 


244 


140 


GRVD VFHHVAQ AGL KLL S SGDLPAWP PK 
ELRLQV 


13530 


27431 


A 


13684 


191 


777 


NSDEHVYRCLYGHKLSFLIiSKYLKSGLA 
GVLLPLDAAVDMEKIEEQFANLHIVKCS 
LGTKE PT YLLGI DTS KTVQAGKENLVAV 
LCSNGSIRIYDKERLNVLREFSGYPGLL 
NGVRFANS CDS VYS ACTDGTVKCWDARV 
ARE KPVQL FKGYPSNI F I S FD INCNDH I 
I CAGTE KVDDDAL FGGFGDARME FLQN 


13531 


27432 


A 


13685 


150 


253 


VGWGLWL VP V I PAF WE AKMGES LDPRNL 
LPCWAT 


13532 


27433 


A 


13686 


500 


321 


IEMGSHYVAQAGLELLSSSNPPTSVSQS 
AGIVGMSHHALAAI SKSSDAS I TSHHPM 
TTP 


13533 


27434 


A 


13687 


6 


424 


MSLLQCGGIRCFKMPEPAKSAPAPKKGS 
KKAVTKAQKKDGKKRKRSRKESYSVYVY 
KVLKQVHPDTG I S S KAMGIMNS FVND I F 
E R I AGE AS RLAH YNTKRS T I T S RE I QTAV 
RLLL PGE LiAKHAVS EGTKAVTKYTS S K 


13534 


27435 


A 


13688 


178 


293 


TGYS S QAQWLMTVLTATWE AEAGGSLQ P 
RSLRPAWATQ 


13535 


27436 


A 


13689 


663 


144 


KELSAVSAGIPHSCGSQGCGGGSVAACV 
P AAPAAAGL C S GRAQ KVP P P P S LAGWP P 
GVNAPPPPVCSSVRLHVCQSDRLWRLA 
ARRGI LALLRSALKAATLAGCQS VRWS V 
RPSESLRPTSNAASLFRSSVPTVLSHSV 
PLiAASLGKRRACGGREHASVAVYLSVCL 
SLPT 


13536 


27437 


A 


13690 


125 


3 


FAHQGHAPGQAWWLRP VI PVLREAELGG 
RPEPRSSRTARA 


13537 


27438 


A 


13691 


136 


24 


GGPPFFFFFFFFFFFFFFFFFFFFFFFF 
FFPGKLQQDFQYYVK 


13538 


27439 


A 


13692 


144 


1 


MLKIVQSGEGLTLKFKNFCLLSLCTLFP 
TLIALTTLLLPISPFILIIL 


13539 


27440 


A 


13693 


1476 


380 


NSWSTItAS ELTLWAATSAD PERKS QAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
VNS LE KFH FKS WMS KHRKT YS TEE YHHR 
LQTFASNWRKINAHNNGNHTFKMALNQF 
SDMSFAEIKHKYLWSEPQNCSATKSNYL 
RGTGP YP P S VD WRKKGNF VS P VKNQGAC 
G S CWTF S TTGAIiE S AI AI ATGKMLS LAE 
QQLVDCAQDFNNHGCQGGLPSQAFEYIL 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, ENHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
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W=Tryptophan, Y=Tyrosine, 
X-Unknown, *-Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














YNKGIMGEDTYPYQGKDGYCKFQPGKAI 
GFVKDVANI T I YDEEAMVE AVALYNPVS 
FAFE VTQDFMMYRTGI YS STS CHKTPDK 
WHAVLAVGYGEKNGIPYWIVKNSWGPQ 
WGMNGYFLI ERGlOSnyiCGLAACAS YP I PL 
V 


13540 


27441 


A 


13694 


87 


2 


PGWAWWFTPVILALWETEAGGSPEFKSL 


13541 


27442 


A 


13696 


130 


2 


CNILIII IQ VCWAWWLMP VI PALWHAE P 
GGSPEVESSRPALV 


13542 


27443 


A 


13697 


282 


107 


GLTKCPGTVCRSLKWI S YQLS I ICLSM 
MEHSGTILAHCSLKLLGSSDPPTSVSRV 
AS 


13543 


27444 


A 


13698 


198 


3 


CSITVNKVKVMTLFRSSMSLEIVCDIDL 
AVLRRDLGSLQDPPPRFTPLSCLSLPSS 
WDYRRPLV 


13544 


27445 


A 


13699 


2 


120 


ARGDGVS LWLPRLKCNGAI S AHCNLRP T 
GSSDKNFKTRF 


13545 


27446 


A 


13700 


60 


341 


PDMGLEDERKMLTESGDPEEEEEEQEEL 
VD P L TT VRE Q C E Q L QKC VKARERLE LCD 
ERVSSRSHTDDDCPEELFDFLHAKDHCV 
ALKLFNNLQ 


13546 


27447 


A 


13701 


6 


540 


KNSRTLGQCGGIRGSLCRPRKPGVGGTQ 
TRAVRPAVCSADSARPHLPLRRADMKDS 
LVLLGRVPAHPDSRCWFLAWNPAGTLLA 
SCGGDRRIRI WGTEGDSWI CKS VLSEGH 
QRTVRKVAWSPCGNYLASAS FDATTCIW 
KKLTLRIYNILRKLEACIKPLCCALKYG 
CLEEKQLHS 


13547 


27448 


A 


13702 


239 


346 


AKRGWLMPVI PALWEAKVGKSLEPRI SR 
PLWATWQN 


13548 


27449 


A 


13703 


210 


300 


LMPVI S ALWEAKARGLLE PRTLRPAWAT 
W 


13549 


27450 


A 


13704 


141 


440 


PSAFEHFEKINLYFLKFCISQGFPERQN 
NRDRE IHRERYERDRERHRERYERGLRE 
LAHVIVEAEKPHHRPFITWETLGCWWSG 
SVQVQKPWEPGKLMV 


13550 


27451 


A 


13705 


204 


438 


LLNVLLTQLFFLFLSLRQSCSVAQARGQ 
WYNHGSDQP S THGASNP PTSAS QSVGTT 
GMSHDHGLFLHFS TLD I FSSL 


13551 


27452 


A 


13706 


1707 


1821 


AKAGGS QHLE I LANAVKP CL YWECKMAG 
PWWCAPVGG 


13552 


27453 


A 


13707 


167 


334 


IRRANFKI LTE IGWTQWLTP VI PALWE A 
EGGGLLE PG S P S L VFLF F I LTT I LMRKK 


13553 


27454 


A 


13708 


224 


344 


KDTAMEE E I KDTE KTGKP WLGAVAHACN 
PSTLGDQGRQIA 


13554 


27455 


A 


13709 


1476 


380 


NSWSTLASELTLWAATSADPERKSQAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
VNS LE KFHFKS WMS KHRKT YS TEE YHHR 
LQT FAS NWRKI NAHNNGNHT FKMALNQ F 
SDMSFAEIKHKYLWSEPQNCSATKSNYL 
RGTGP YP PS VD WRKKGNF VS P VKNQGAC 
GS CWTFS TTGALE S AIAI ATGKMLSLAE 
QQLVDCAQDFNNHGCQGGLP S QAFE YI L 
YNKGIMGEDTYPYQGKDGYCKFQPGKAI 
GFVKDVANIT I YDEEAMVEAVALYNPVS 
FAFEVTQDFMMYRTGIYS STS CHKTPDK 
VNHAVLAVG YGE KNG I P YW I VKNS WG P Q 
WGMNGYFLIERGKNMCGLAACASYPI PL 
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V 


13555 


27456 


A 


13710 


763 


257 


YEKILKLTADAKFESGDVKATVAVLSFI 
LSSAAKHSVDGESLSSELQQLGLPKEHA 
AS L CR C YE E KQ S PLQ KHLR VCS LRMNRL 
AGVGWRVDYTLSSSLLQIRGKSPWCHCG 
WKVATCPQGPQAPACLPCPFS S KTKFPR 
SSLARTESRPKTLMKLPWAEEKGVPRPV 


13556 


27457 


A 


13711 


133 


401 


VLSKSGNPYWGLSTNHEIKGQEGCAWQIi 
TPVIPALWEAEAGGSLEPRSLRPAWATS 
WLPRE I KDDLLKNGCGPSKI TQGGPMVA 
GSLKL 


13557 


27458 


A 


13712 


212 


74 


RRINLAPPRVFLKGPPFFFFFFFFFFFF 
FFAWGNPLFLFNSTFGRT 


13558 


27459 


A 


13713 


324 


141 


D R VL PRL E CSG VI TAR YRLNL LG S RDP P 
T S AS L I AGS TGTCHHAWLS FS YLFTYVF 
FIIL 


13559 


27460 


A 


13714 


299 


431 


I RL I WLS AVDHACNS S TLGGHGVQMTKG 
QDFETNLANTANPHPY 


13560 


27461 


A 


13715 


133 


1 


ILVWAWWDMPAIPVIiWEAETEGLLEPRS 
LRPAWAT W KNT 1ST KN 


13561 


27462 


A 


13716 


18 


165 


I PLHTHTHTHTHTHTHGYLLALSGTEFL 
SVLYMASERECRTYNLMLNK 


13562 


27463 


A 


13717 


119 


1 


KPNVSQAQWLAPVI PALWEARVGVLFKP 
RSLRPTWATQQ 


13563 


27464 


A 


13718 


271 


64 


VLRS I S TLNE ILAKTRNPPLQ I LMKPRK 
RWGTVAHLRAPVGDQPDQHSETPSLLKI 
TKISWVWWRTPVI 


13564 


27465 


A 


13719 


21 


209 


KDGPKGTGGSETNESVEMTECCSVTQAG 
VQWHDLGSLQPPPPGFKRFSCLGLPSSW 
DYSRND 


13565 


27466 


A 


13720 


391 


151 


RGFFLKGEKKKNFFPIILGKKILGSPGK 
KGEKKKKKRGKIFFFLKNPLGFFPKKKV 
LGEKKNPYSGVWGGKKKRGPPPHP 


13566 


27467 


A 


13721 


109 


2 


RRLGWAHACNP S TLGGE VGGS QGPGNC 
GQVTKVS 


13567 


27468 


A 


13722 


69 


369 


RDILQADGAHSLFPTTPSLNAYIIFSSP 
IGPHTHRPYHATPTYLPFYTNNLIKKKK 
KKKKKKKKKKKKKKKKKKKKKKKGGGPF 
KKTPGGAP I FGGGGK 


13568 


27469 


A 


13723 


58 


1208 


F WNENS PAS ELAPNGGGS VTS VPRLED Y 
LTRPQLYKLTGVEGPSRANSRDS FHDRK 
TYKTPSANMMVLKVEELVTGKKNGNGEA 
GE FL P E D FRDGE YE AAVTLE KQE DLKTL 
LAHP VTLGEQQWKS E KQRE AELKKKKLE 
QRSKLENLEDLE III QLKKRKKYRKTKV 
PWKEPEPEIITEPVDVPTFLKAALENK 
LPWEKFLSDKNNPDVCDEYKRTALHRA 
CLEGHLAIVEKLMEAGAQIEFRDMLEST 
AIHWAS RGGNLDVLKLLLNKGAKI SARD 
KLiLSTALHVAVRTGH YE CAEHL I ACEAD 
LNAKDREGDTPLHDAVRLNRYKM I RLLI 
MYGADLNI KNCAGKTPMDLVLHWQWGTK 
AI FDSLRENS YKTSR IATF 


13569 


27470 


A 


13724 


400 


135 


KWNGKELNGVDG I GWDWNEM YWKGLE FH 
E TE WNE LERNGLE WNGM S WDG I KCNG I K 
WNGIECSGMLWNGMECIGMECPLMEWIR 
VEWN 


13570 


27471 


A 


13726 


186 


344 


S ASLGLWRCRDCRRS LVHS VLNVAQAGV 
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QWRDLSSLQPPPPSRLPWPPKVLR 


13571 


27472 


A 


13727 


380 


168 


APPQKKVFSSKFPQGFFFAPPNRKKLFF 
SPPRKIVPPPKFFLiWAPPLFFFFFFFFF 
FFFFFFWYSRTDYK 


13572 


27473 


A 


13728 


324 


219 


RIMFFI FFYFFFFFFFFVFFLFFFFFKR 
PPPFFY 


13573 


27474 


A 


13729 


2 


270 


RLQEASLSPVPEILGGALPSAKRTSYLL 
LSTLCLLLSETASRGNSLTGLGHRSDHY 
NS VRS GGQSLYSACP I VTT IHGTCYGGK 
AKCCK 


13574 


27475 


A 


13730 


134 


3 


AVHRCKKWGRGWGRWLTP VI PTL.WEAE V 
GGPLEPRSLRPALAT 


13575 


27476 


A 


13731 


429 


641 


RKKAVCFMNDLICFLDNTFKNNVLSQAW 
W CVHLVP T I WE AE AGGSLE PRS LKLQ C P 
WAPVMCTPAWAT 


13576 


27477 


A 


13732 


188 


32 


I FFFFLFFYFLFFFFFFFPRSRHCTLQP 
GQQSETRSQNKKQKTKNKQTKNLP 


13577 


27478 


A 


13733 


341 


3 


GTVTPEEPKKMGDQTPRGLPRPERTLGR 
LSTSESRVDPTPRVGGGSSGGRTAPWGP 
PRERGRVAGS S LHRACNFS TSNT 1 1 SVG 
LWLTPVIPTLGEAEAGGSPEVRSSTHAS 


13578 


27479 


A 


13734 


208 


2 


KL YKNFF FRFTKNP P PLLKKGF YL YNF F 
F KKKKK I VKKKKKKKKKKKKQ KKKKKKK 
GRPLAI KNYPRV 


13579 


27480 


A 


13735 


16 


212 


E G VS LKL PRL E CRGT I S AH CNLRHRGRR 
CTPASASQILRRLKQENHLNLGGGGYRD 
PIKLQRTAS 


13580 


27481 


A 


13736 


22 


214 


P DRPAL P TRPGGW I TWGRE VRDQ P GQ CG 
ETPSLLKIEKLAGCDGTCLEQNCLQNNN 
SNNKVGIR 


13581 


27482 


A 


13737 


386 


51 


WGNSITQQALFEHLCFSGHTSLPAVPCP 
HQACFPTLPLICRGI I IAHCNLGLLGSS 
DS PI S AS P I DGNTGLHPHARL FS FLLWY 
RWGLTMMPKRSLSPGLKPSSRLADAWV 


13582 


27483 


A 


13738 


204 


431 


S QHFG I S RRADHLS SNVGDQ PEQHGKNS 
LYKYKKINQGWAWWLTPVIPPLWEGQVG 
GS PEGKELKTPLRNLGKPHL 


13583 


27484 


A 


13739 


69 


159 


I KKEALS WAWWLTPVI PAL W KANf AGGS P 
E 


13584 


27485 


A 


13740 


32 


285 


FFLSFFLFFFFFFGGGGKKKTKKFFSQD 
FLGPPKILGGGGFFSFFFGGPFFLKKGK 
KSPQNLFFLKKNPPPFFFPPPKKPLWGF 


13585 


27486 


A 


13741 


372 


213 


YRSCVLLQQNPPLFYQSWWNFSLCVCVC 
VCVCVCVCETGSLKKFLiLTQKKKR 


13586 


27487 


A 


13742 


29 


237 


' T KRKG YKTNKG YWAWAW WLT P V I PALFI 
AEGNLILQMQKQAQPEAGCGCVCVCVCA 
CMCVCVCVFKVFF 


13587 


27488 


A 


13743 


322 


183 


HTHTHTHTHTHTHTHTNPIHVFWLLLKY 
PKIQSHQFHISAFQDPLL 


13588 


27489 


A 


13744 


305 


373 


VGWGWWLTPVI PALWEAKVGGS 


13589 


27490 


A 


13745 


172 


1 


KNKIEEQMKWEMRQEAVAHSCNPSTIiGG 
QGE FE TS L VNMMKP CL YKKMQ K I S Q VC S 
C 


13590 


27491 


A 


13746 


285 


1045 


FQHPFGLSQSEMAAVKASTSKATRPWYS 
HPVYARYWQHYHQAMAWMQSHHNAYRKA 
VESCFNLPWYLPSALLPQSSYDNEAAYP 
QS F YDHHVAWQD YP CS S SHFRRS GQHPR 
YSSRIQASTKEDQALSKEEEMETESDAE 
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K=Lysine, L=Leucine, M=Methionine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














VECDLSNMEITEELRQYFAETERHREER 
RRQQQLDAERLDSYVNADHDLYCNTRRS 
VEAPTERPGERRQAEMKRLYGDSAAKIQ 
AMEAAVQLSFDKHCDRKQPKYWPVIPLK 
F 


13591 


27492 


A 


13747 


2 


305 


GR VGS FS VRD VE LSD P ARE RGE MP VAVG 
PYGQSQ P S CFDRVKMGFVMGCAVGMAAG 
AL FGT FS CLR I GMRGRE LMGG I GKTMMQ 
SGGTFGTFMAIGMGIRC 


13592 


27493 


A 


13748 


2 


305 


GR VG S F S VR D VE LSD PAR E RGE MP VAVG 
PYGQSQPSCFDRVKMGFVMGCAVGMAAG 
ALFGTFS CLR IGMRGRELMGG I GKTMMQ 
SGGTFGTFMAIGMGIRC 


13593 


27494 


A 


13750 


238 


423 


AVS WDQLTWGTGVQEKKI QAQAWGLMP V 
IPTLWEGEVGGSPEVKSSRPACPHGKTP 
FLLKMQ 


13594 


27495 


A 


13751 


455 


248 


ISVGPGIFQLNFCFLEPYGTFLSEKKFF 
IMETGSRFVIQAGVQWGSYSSPQPQFFG 
FESSLKPPHSGVK 


13595 


27496 


A 


13752 


2 


98 


I P T P T I HTHTHS HTHTHTHTHT S FL YM P 
PDLK 


13596 


27497 


A 


13753 


196 


2 


IWAPPKIFLKKPPLFFFFFFFFFFFFFF 
FFFFP FF 1 1 S FI PWPLTRKQKLERS WVF 
FKDSACSA 


13597 


2749S 


A 


13754 


917 


379 


KYKKCSLQKNLLLVGCKKYSLCYRRHKL 
HLVTHGERKKPAAVNSFFLFFS FFFFLN 
LVRNTE I TKKRVNLHETKADAE S CNQD C 
TTSSETASELEQIRSGKHNSGKWAGEGA 
AGGRREGERMEWTEMRRGARGRRGGRER 
EMERARVRGGEEREKEIDLYKKVTSKIE 
ETKLGNLLKLG 


13598 


27499 


A 


13755 


175 


21 


KKKNFFFPPRVILGPPKVFFKRAPLFFF 
FFFFFFFFFFFFFLKKSWRPLAI 


13599 


27500 


A 


13757 


234 


341 


E IGWVQWLMPVI P AVWEAE VGGLLE PRS 
LRAARAI 


13600 


27501 


A 


13758 


1 


365 


PAPNRRGGHIQDRATNSTELGANQCFFF 
S PRPPSLEKKTPE INKEPRPAPQPSNPG 
NLGTREGGDSWAGTTRCLRRTDEGDTYR 
TEPPTALSWGQTRAFFFPPALPAGKKRH 
RNLLKTQFF 


13601 


27502 


A 


13759 


87 


181 


SHTHTHTHTHTHTHTF YLHWIHFDME I 
LGL 


13602 


27503 


A 


13760 


1 


228 


ARGERERERERERERERERERERERERE 
RYREGGRLLLTMGEGETERATDL YHTP P 
PSIKAWRLCEPRPRAGILCGRNIF 


13603 


27504 


A 


13761 


383 


197 


RCD P PAWVSQSAR I TGVS YRAQPATSVL 
MGEEDFLEDPVIAFVRLAPAVLLSKLSE 
VSVAMT 


13604 


27505 


A 


13762 


276 


57 


YAVLGGGGGRKNLFVFVFFLFFFFFFFF 
FKKQGFGLMPLGEDRGPNMGFFYFPMVG 
DKTKLPFKKKKKKRPR 


13605 


27506 


A 


13763 


377 


209 


PRPAYGPAFLFTEGFSWNPFFFFQMESR 
SVAQAGVQWCYLGSLQPPPPWFPGTLLN 


13606 


27507 


A 


13764 


160 


202 


MEKYNVHPHSGILHSHEKEQAALFTIAK 
RWRQ P S YP S I DE WKN I MS IHT VE Y YTAM 
KRNKQLYSQ 


13607 


27508 


A 


13765 


390 


284 


ESGGHFLSLSLS I YI YIHI YI YVCVYI Y 
THTSHTI 
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C=Cysteine, D=Aspartic Acid, 
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Q—Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13608 


27509 


A 


13766 


350 


41 


YWILQENRNTQNRKLLPKSTELiCMLLQ 
MLLFILNTTNYPLPLWQCYHRFCSFFLK 
TGS F YVTQAKVRWLFTGM I TVHCSLKLL 
DSSHPPKWLVLQELATVAS 


13609 


27510 


A 


13767 


330 


156 


KITQAWWCVHWPDNREAEVTGVIRLGR 
MRLPLEGACTNCVPSWEKEQDPVSHNEK 
K 


13610 


27511 


A 


13768 


25 


292 


KCFFLSWRGGSRLQSQHFGRPRRADHLR 
PGVPDQPGQHGFLLVKMSDKPDLSEVEK 
FDRS KL KKTNTE E KNTL P S KE I F FS L VG 
VNIQD 


13611 


27512 


A 


13769 


585 


680 


KCLGSRTRWLTPVI PTLWEAEAGGSLRP 
RSS 


13612 


27513 


A 


13770 


488 


359 


PS PRERKFLP FFKKKKGWGP P P P PKKNR 
GRGPQQKEGPQKPKP 


13613 


27514 


A 


13771 


794 


515 


PDMGLEDEQKMLTESGDPEEEEEEEEEL 
VDPLTTVREQCEQLEKCVKARERLELCD 
ERVSSRSHTEEDCTEELFDFLHARDHCV 
AHKLFNNLK 


13614 


27515 


A 


13772 


130 


397 


WGLTPLCFCMRASLPRHSKPQARKTKK 
KKKKKKKKKKKKKGGSLKKKI FGGAKKS 
GATKKKNFPKKRGQNKKNPGFFKNE I FF 
GGGAI 


13615 


27516 


A 


13773 


376 


145 


TRVGGGCSEQRLCHCTPSWATEPNPVSN 
NNK I P LE L FP S NQ T F FH F VRNWKKRGNN 
WLSNYS I S STVTRHFPTLFQF 


13616 


27517 


A 


13774 


1 


247 


GLQS LADDLSGLLQKVCLPLSNG I MDL Y 
LLYYLFTFETGS CSVAQARVQWCEHGSL 
QPHTSAS SDPPTS ATTTCHHARL I IW 


13617 


27518 


A 


13775 


376 


145 


TRVGGGCSEQRLCHCTPSWATEPNPVSN 
NNKI PLELFPSNQTFFHFVRNWKKRGNlSr 
WLSNYS IS ST VTRHFPTLFQF 


13618 


27519 


A 


13776 


107 


468 


EKARS PERARDQEGGTERDRNSEREKI L 
PKLREELPWVSGGWRCPWPHQGWSHWED 
MPGWGIPTGPSVGWGEKKGPGEGRSHKY 
GTGRTKCELGVSIGNSAFTFLLHFYFKH 
RKRERKQI 


13619 


27520 


A 


13777 


125 


2 


NIFFPLFELYYFKCHWARWLTPVIPALW 
EAKEGGSPEVRSS 


13620 


27521 


A 


13778 


166 


74 


GRICYSHSLSKHTHTHTHTHTHTSHISF 
IP 


13621 


27522 


A 


13779 


1 


341 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERLS FSLGGGGALKKKRI FLCVTLE 
GKHLP PLTHTGFFGENTHTLSMKTREGG 
FICAGGDERRSASVMRAYIYRECEPHTH 
V 


13622 


27523 


A 


13780 


'614 


337 


RRCSSALCYRRHGNHKVKIRS KQGASVS 
PHEHLRI LSLLEVRNVGLGHS I FPHSLR 
M YVCM YVCM YVC I S I YL S I YL S I YL S I Y 
LSISISSR 


13623 


27524 


A 


13781 


390 


158 


VL I RL S WGR I C FQAHS GCWQNS GS C SHK 
TEGLCLLLAGSCSWLLGGALKGWASKEE 
FYTMQHSCQSDTLSPLPYSIH 


13624 


27525 


A 


13782 


308 


163 


TKFYLLFFFFFFFLFIFFFFFFFLKKKF 
LFFIQGGGKIKTPGTLTFLG 


13625 


27526 


A 


13783 


45 


277 


1ASGRPFFFFLHLPPFQKAFVFGGGGGT 
PFEYQNFVAYIKGKQGNPFLFGCGDLFN 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=PhenyIalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=GIycine, H=Histidine, I=Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possibIe 
nucleotide deletion, \=possib!e 
nucleotide insertion 














APQL I KQLSQLGQKYPKKPLL 


13626 


27527 


A 


13784 


413 


218 


PNFPPFLFFSGKFFFFAQNLPFGFFPPP 
GEGKAFPGQGCPVGGGGFAGPTFFFGPP 
AFPEILYFF 


13627 


27528 


A 


13785 


1011 


1368 


I DWENDLTPMVLCHGPS I YNKYS FSQNI 
FPISANFFFIAHINLTLLQLVSELSHKI 
I LRHF Y I KP I KD I LNLYL I L PGVWLFDL 
CKSTAICRYLCVCIYIHLYIHTHTHIHI 
YTHIRL 


13628 


27529 


A 


13786 


161 


425 


RWGS C YCPGYPHL3CI FKLLLLLVLVLEM 
GVLPLLPRLECSGMITVHCNLEFLGTSY 
PPISVSQTKKKKRPFTLPPLPNLQIGLP 
LRGE 


13629 


27530 


A 


13787 


70 


311 


DLISITALVISPNKFYIYLSIYLSIYLS 
I YLS I I YL I F I S TNLL WVLS L WETLNKT 
RLLNLFLTIQNNIWNPKLLVSALI 


13630 


27531 


A 


13788 


25 


219 


RNMAAATLTSKLYSLLFRRTSTFALTI I 
VGVMFFERAFDQGADAIYDHVNEGKLWK 
HIKHKYENK 


13631 


27532 


A 


13789 


279 


138 


RRSIiALSPRWDCGLQWRNLGSLQALLPG 
FTPFSCLSDPSSWDYRPF 


13632 


27533 


A 


13790 


138 


28 


GGWVRWLTPVI PALWEARVGGLLEPGSS 
KKERGKKEV 


13633 


27534 


A 


13791 


133 


7 


YSCQRCIiTHGQAQWLiTTVISAPWEAKAG 
RLLE PRS VRSAWAT 


13634 


27535 


A 


13792 


86 


290 


EHVHRTLIEASVSGWGWWLTPVISAAWE 
AEVGGSLEGQEFEISLGKCSETPSLTTT 
TTESLGSFAATS 


13635 


27536 


A 


13793 


150 


37 


KI I YPSQARWLMPVI PALCEAEVGRLLE 
PSSSRLAWAV 


13636 


27537 


A 


13794 


216 


77 


PTQHPTEGYMIKQQTVYHECRMWANSHL 
CLLPEGLLRAVTPWCHAP 


13637 


27538 


A 


13795 


251 


347 


VARAWGLAPVI PALWEAE VGGS PEGRS L 
KPAL 


13638 


27539 


A 


13797 


177 


274 


VQWHMS WPAI QEAE VGGS PE PRSLRPA 
WATY 


13639 


27540 


A 


13798 


307 


406 


KFKKGWAWWLMPVI PALWEAKAGGS PE S 
RSLRP 


13640 


27541 


A 


13799 


194 


3 


FFFFFFFFFEMGSHYIVQARVQWLFTDA 
NIVHCSLQLLASSDPPVSTSQVGLQACA 
DDAQNPE 


13641 


27542 


A 


13800 


333 


198 


LPSVFFFFFQTESCSVTQAGVEWCDLGS 
LiHS SLGNRARLCLKIK 


13642 


27543 


A 


13801 


1 


335 


ETERGRERETERGRERETERGRERETER 
GRERERERERERERERDREEGRERQRGE 
GKKNRDREENRRRETERVGDWCVCPHP 
NL I LNCS P Y I SHVLREGLGKT YLNYGGS 
FPTLCSG 


13643 


27544 


A 


13803 


14 


454 


RVFFDRSRYRSRTLGSTHASALLGILVY 
RSHLI S SLLCLEGI ILSLFI IATLITLN 
THS L LAN I VP I A I L VFAACE AAVGLAL L 
VS KKKKKGAP VL KN ? WGAQ S LRGQARKY 
FFPYREPNKNLPGNLGKEPFLGGGDILG 
QPPYKN 


13644 


27545 


A 


13804 


1474 


1661 


TDFHNSHLLKCCKNCIFSTLNYYIRREH 












F S I VF I F C Y I CWKVHY I RRELFRCL Y F 
LLYMCC 


13645 


27546 


A 


13805 


360 


3 


LWGKRGGPFSPPPFFFFLELFFFLKKMG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














GGGFFGGFPWGPPKGEIFFFVFFWAPQK 
KKKKKPPPFIFWGGGPFFFFFFPPPKQK 

KKKNKKKKKKKQKKKKKKKKKRAARWPA 
RSRSSA 


13646 


27547 


A 


13806 


1 


425 


RLGGVALRSAADGAFVSGEFCGGKLLRW 
CLVTDFPPDSCTTCSYSRRSTPGCSPGG 
S RGL S E GEG S S VS LQR S R VL S AMKHVLN 
LYLLGWLTLLS I FVRVMESLEGLLESP 
SPGTSWTTRSQLANTEPTKGLPDHPSRS 
M 


13647 


27548 


A 


13807 


5 


313 


EKPYLQGITFTSKTTHLIYEFTPYTIIV 
KE YS TN Y VL L I GNTQ I TK I YH I VFRKF S 
MTP I YKFQDLAQWLTPVI PTLWEAKPGG 
LLEPRSLQSSYSLIILIL 


13648 


27549 


A 


13808 


182 


314 


V I TQLD KTE RAQ WLMP V I L VL YE AE VGG 
LLEAMSSRPAWATKTW 


13649 


27550 


A 


13809 


219 


1812 


LPPPESGAMSGFNFGGTGAPTGGFTFGT 
AKTATTTPATGF S F STSGTGGFNFGAP F 
QPATSTPSTGLFSIiATQTPATQTTGFTF 
GTATLASGGTGFFLGIGASKLNLSNTAA 
TPAMANPSGFGLGSSNLTNAI SSTVTSS 
QGTAPTGFVFGPS TTSVAPATTSGGFS F 
TGGS TAQ P S G FNI GS AGISTS AQ P TAP ATL 
PLT P AT PAATTAGATQ P AAP T P TAT ITS 
TG P S L FAS I AT APT S SAT TGLS LCT P VT 
TAGAPTAGTQGFSLKAPGAASGTSTTTS 
TAATATATTTTSS STTGFALNLKPLAPA 
GI P SNTAAAVTAP PGPGAAAGAAAS SAM 
TYAQLESLINKWSLELEDQERHFLQQAT 
Q VNAWDRTL I ENGE K ITS LHRE VE KVKL 
DQKRLDQELDFILSQQKELEDLLSPLEE 
LVKEQSGTIYLQHADEEREKTYKLAENI 
DAQLKRMAQDLKDI lEHLNTSGAPADTS 
DPLQQICKI LNAHMDS LQW I D QNS ALLQ 
RKVEEVTKVCEGRRKEQERSFRITFD 


13650 


27551 


A 


13810 


134 


263 


KNSLFKKKNNGRPWPLMPVI PALWEAEA 
GGLLE PRGLRPTWVT 


13651 


27552 


A 


13811 


210 


51 


TLSHRKP I S FE AYFARVRGF I LEVSETR 
NPPIGWTRWLTPVIPALWEAEAGG 


13652 


27553 


A 


13812 


241 


139 


LHEAGLAPMI PALWEVKVGGLLEPRS PR 
PAWASW 


13653- 


27554 


A 


13813 


539 


256 


RTTQMSTAAGFTEAPNWKQPRCPWTEEQ 
TNKMWS LHAME CAS AMKWNE VL I Q P AVR 
MARENSRRKPGDMHDIWSVWSAGEWLPG 
LGRSTGKGSD 


13654 


27555 


A 


13815 


63 


369 


VRETPLKTHYLEEISSPASPTAIPQSLL 
FS FFISPPSS LATGS GHS GH P VHS LHH P 
PETE P S PS VCLWAGPKVPPGAAGKGS PD 
SNPLVIRSLLAPPPASL 


13655 


27556 


A 


13817 


258. 


3 


AE SAP P F S TNS L F FTHTHTHTHTHTHTH 
THSLRTTRWAPSQVCRPQTSQRGGECVG 
PTAFAP S PTLLKPHHPS S HVHL P S Q PRR 


13656 


27557 


A 


13818 


516 


665 


WFKS GS F WLGMVAHTCNP S TLGGQGGW I 
TSGRSRTSVTTTTTSQTCAPA 


13657 


27558 


A 


13819 


295 


361 


WLTPVI PALLEAVTGGSLEPRS 


13658 


27559 


A 


13820 


166 


368 


G WWAT QHS AQ YWS S S S I GAL CLGE S C S 
VTQAGVQWCDLSSPQPSHPGFKRFLCLS 
LPSSWDHRGDL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26' 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


13659 


27560 


A 


13821 


133 


530 


RKYCGQCLNMALNNVSLS SGDQRSRVAY 
RS S HGDLRPRAS ALAMVS GDGFLVS R P E 
AIHLGPRQAVRP S VRAE S RRVDGGGRS P 
REPDGRGRSRQARFSPYP I PAVEPDLLR 
S VLQQRL I ALGGVI AARI S V 


13660 


27561 


A 


13822 


76 


1958 


RQEL I WPLCS PPQGDRFLQKS WI FFRP V 
MADKLTRIAIVNHDKCKPKKCRQECKKS 
CPWRMGKLCIEVTPQSKIAWISETLCI 
GCG I C I KKC P FGALj S I VNL P SNLE KE TT 
HRYCANAFKLHRLP IPRPGEVLGLVGTN 
G IGKS TALKI LAGKQKPNLGKYDDP PDW 
QEILTYFRGSELQNYFTKILEDDLKAII 
KPQYVDQIPKAAKGTVGS ILDRKDETKT 
QAIVCQQLDLTHLKERNVEDLSGGELQR 
FACAWC I QKAD I FMFDE P S S YLDVKQR 
LKAAITTRSLINPDRYI I WEHDLSVLD 
YLSDFICCLYGVPSAYGVVTMPFSVREG 
INI FLDGYVPTENLRFRD ASLVFKVAET 
ANE E E VKKMCM Y K Y PGM KKKMGE FE LAI 
VAGEFTDSEIMVMLGENGTGKTTFIRML 
AGRLKPDEGGE VP VLNVS YKPQ KI S P KS 
TGSVRQLiLHEKIRDAYTHPQFVTDVMKP 
LQIENI IDQEVQTLSGGELQRVALALCL 
GKP ADV YL IDE P S AYLDS EQRLMAAR W 
KRFILHAKKTAFWEHDFIMATYLADRV 
I VFDGVP S KNT VANS PQTLLAGMNKFLS 
QLE I T FRRD PNN YR P R I NKLNS I KD VE Q 
KKSGNYFFLDD 


13661 


27562 


A 


13823 


3 


292 


KCWDYVHEPPRPAPMQTYQVDLRCCFVS 
TINLLVCGERASLPSVESLVFSGLAEVW 
MKAVLAP S RE S GKL VLQQ VP E YAE AI WG 
ARPACRGL.VFSA 


13662 


27563 


A 


13824 


245 


91 


YKVNFGPPRGSFFRPPPFFFFEVFHFFF 
IIFFFFFFFFFFFSFFLGCFMYV 


13663 


27564 


A 


13825 


276 


100 


PGQKKKTLS PKKKKS I YL Y I HTHIY I YT 
HTHYI YKYIFVCVYICVS IYIHTHIVPR 
IFS 


13664 


27565 


A 


13826 


202 


375 


IYNTYIHTHTHTHTHTHNVYLYIQSGDC 
IYAVLLGELSFIYIFVFLSSLISILFHL 
TY 


13665 


27566 


A 


13827 


278 


34 


KPRGLS PAGVSAQRAWGPHS CPRLPDPR 
TELQVRQ I PALVGS S AFQGRRGRKGPVG 
GSGPSRPRAPPQPRPGCCQCTLLQI 


13666 


27567 


A 


13828 


62 


307 


GGARAI FLNVPHLKRPMFFLELGLSTKC 
PVGTEFALDSLLRLYNSNTFCWVQWLMP 
TI PALCKAEMGGLLEARS L I PAWAT 


13667 


27568 


A 


13829 


386 


478 


NRIGMVSHACNPSTLGGQGRRIMRSGDQ 
DHS 


13668 


27569 


A 


13830 


364 


484 


NFNI WLGWAW WLM P VI PAL WE AKAGGS L 
EVRGSGPAANMV 


13669 


27570 


A 


13831 


33 


486 


PARSAEFGTRRERERERERERERERERE 
RERERERE RE RE RE RERERE RE RERERG 
VICVCPCALLCVRESDPLSQKKYCVCVF 
WRYTESLALSPFCLGRCLALLIFLEGM 
WRKIWAPSFLSLCVGALFFSQHPPPFWC 
LCVLSLVRPLPLSLSGGVFVSQRALI 


13670 


27571 


A 


13832 


6 


718 


YSAVE FAMAGVGAGPLRAMGRQALLLLA 
LCATGAQGLYFHIGETEKRCFIEEIPDE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leu cine, 
K=Lysine, LHLeucine, M^Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V— Valine, 
W=Tryptophan 7 Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TMVIGNYRTQMWDKQKEVFLPSTPGLGM 
HVE VKDPDGKMLQ WLSRQYGS EGRFTF 
TSHTPGDHQICLHSNSTRMALFAGGKLR 
VHLD I Q VGEHANN YPE I AAKDKLTELQL 
RARQLLDQVEQIQKEQDYQRYREERFRL 
TSESTNQRVLWWS I AQTVILILTGIWQM 
RHLKS FFEAKKLV 


13671 


27572 


A 


13833 


17 


130 


RLQEFGTRRERERERERERERERERERE 
RDTHSFLHG 


13672 


27573 


A 


13834 


283 


460 


LWRLAI CKNYSLKRETEPGS VAHVYNP 
S TLGGQGGRTARGQE FKTS LDN I ARP PS 
LQK 


13673 


27574 


A 


13835 


3 


435 


WPRFCTALQEFGTRRERERERERERERE 
RERERERERERERERERATSLSRARPSF 
LFPSACVSHTPYVGERENLSLHT-LKHTH 
TREKHSLSYTNASARDTRARFSEGRAPP 
IYSVYLTHGVFFFFFFSLCLSVRHAEGL, 
AHV 


13674 


27575 


A 


13837 


351 


57 


RTLVHISKFTGNLSDLYYVQSEWQDLGT 
ADLSIMTSVLHCLPSLESIKNTDSWPSS 
VAHTCNPSTLGGRGRCRDQHGQHSKQPS 
LLiKI QKI ARRCGG 


13675 


27576 


A 


13838 


336 


150 


TFCYKKYLWGAHCFIFRTTGLP IKNFYF 
GQAWCLL P VI S ARWE AETGGS LE PRS LR 
VAWAS 


13676 


27577 


A 


13839 


310 


454 


HCSLGLPQVLLI ILYKILLGWARWLTPV 
I PALWEVEAGGLIiEARSSRP 


13677 


27578 


A 


13840 


163 


297 


I I FFLLDLKLQGAMTIYVS IYLSIYLS I 
YLS I YLYTHTLWNTLQL 


13678 


27579 


A 


13841 


33 


316 


LDQHPTPRSPLLCHSLRKTSSSQGGKSE 
LVKQSLKKPKLPEGRFDAPEDSHLEKEP 
LEKF P DD VNP VT KE KGGPRG PEP TR YGD 
WERKGRC I DF 


13679 


27580 


A 


13842 


4474 


2586 


DGG S GC VKME F P GGNDN YL T I TGP SH P F 
LSGAETFHTPSLGDEEFEIPPISLDSDP 
S LAVS DWGHFDDIiADPSS S QDGSFSAQ 
YGVQTLDMPVGMTHGLMEQGGGLLSGGL 
TMDLDHS I GTQ YS ANP P VT I D VP MTDMT 
S GLMGHS QLTT I DQ S ELS S QLGLS LGGG 
TILPPAQSPEDRLSTTPSPTSSLHEDGV 
E DFRRQL P S Q KT WVE AGKKQ KAP KKRK 
KKDPNEPQKPVSAYALFFRDTQAAIKGQ 
NPNATFGEVSKIVASMWDSLGEEQKQIY 
KRKTEAAKKEYLKALAAYKDNQECQATV 
ETVELDPAPPSQTPSPPPMATVDPASPA 
PASIEPPALSPSIWNSTLSSYVANQAS 
SGAGGQPNITKLIITKQMLPSSITMSQG 
GMVT VI PATWT S RGLQLGQ T S TAT I Q P 
S QQAQ I VTRSVLQAAAAAAAAAS MQLPP 
PRLQPPPLQQMPQPPTQQQVTILQQPPP 
LQAMQQPPPQKVRINLQQQPPPLQIKSV 
PLPTLKMQTTLVP P TVESS PERPMNNS P 
EAHTVEAPSPETI CEMITDWPEVESPS 
QMDVELVSGSPVALSPQPRCVRSGCENP 
P I VS KDWDNEYCSNE C WKHCRDVFLAW 
VASRNSNTWFVK 


13680 


27581 


A 


13843 


54 


251 


EFYRMNSPSPLWLCVCVCVCACLFMCLC 
ALT CMC VW YKS L LHS NLTQ VL S S KF YDS 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ISPAENPGK 


13681 


27582 


A 


13844 


154 


26 


DPHVPPTERSEGSQARWLPPVIPTLWEA 
EACGSPEVKSSNST 


13682 


27583 


A 


13845 


475 


225 


GHTELYGGALFQLLQAAIPEGGGTSCLF 
NPSSDGGS S AAGG WGGAGDLAR S SRVP 
SPTSSRPVPSPDPSPPSLYLTPWKETH 


13683 


27584 


A 


13846 


479 


319 


RDCRRS LTHSVLSGAQAGVQWRDLGSLQ 
PPPPSRLPWPPKAPRWQPLPGHHPI 


13684 


27585 


A 


13847 


182 


499 


LLCVKLCDS CNENKALREQTRAQVQGGH 
SRAQVQQVQGGHSAHTWQ IRSAASRRK 
AFSTCSSHLGMVLLFYGTGSSTYMRPTT 
RYS PLEGRLAAVF YS I LI PTLN 


13685 


27586 


A 


13848 


388 


494 


RDSWERWLEPLI PVLWEAQAGRLLEPRS 
LRLAWAT 


13686 


27587 


A 


13849 


198 


63 


HHATTWMNLEDIMFRKVSQTQKNKYYM I 
PLIWKTKKVLSHREQK 


13687 


27588 


A 


13850 


555 


353 


RCPPGAQAPIiPAPSPAPPVHIALSPLSQ 
GVSWPRELTAHPPEPPPPPFFKIHPHPQ 
LSVGKRGQMTP 


13688 


27589 


A 


13851 


169 


66 


LLAGSTGQARWLTPVI PELWKAEVGGSL 
EIRSSR 


13689 


27590 


A 


13853 


12 


105 


IASGLHDFFKKKKI KKKKKKKKKKKKGG 
AL 


13690 


27591 


A 


13854 


256 


363 


ICLFIKDSDRGQTQWIiTPVILTLWKAEA 
GGSLEPKS 


13691 


27592 


A 


13856 


23 


436 


IDRLSDSHEERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERESSLLCVRHTCFAPPTYIFLWETLR 
VCPTPCVWGALSVFSHTERARI FFYTCG 
GS VARAPVCAHTRI FSSRARARALCARP 
PLSRERRAHTRVFFI 


13692 


27593 


A 


13857 


91 


428 


PDPDTSPLTTDRGETPLGATLPS CYCGG 
QQSFEPKTKKTTNQKKKKKGKKFTPQKK 
NP K I F P P KP FC P KG P RGENQ P KKTQNQ P 
KKKKKI LGGGPAPNS PLGGGKKDTS WAGF 


13693 


27594 


A 


13859 


465 


43 


KRVTTENPQHICSAETAIPWSLSQNCNF 
HILWWGHPNHPIYITTSIYIYIYIPHIY 
GMVY I YTR I CMC VC I YVHI C I HTHTHI Y 
I Y IHTHTH I YMERERET VS CS VAQAGVQ 
WCDLGSLQPPNKRLKRFSCLSLPSSWDY 
W 


13694 


27595 


A 


13860 


290 


52 


W YKYLPGTRGFL I WLNYLKC I GR I ELL V 
GYFSFIYLSIYLSIYLSIYLSSRSIAQP 
GVKRCNHSSLQPRTPGLMHLSNS 


13695 


27596 


A 


13861 


108 


22 


I FFLFSAVKTRFS YAFPKEFP YRMNHVS 
I 


13696 


27597 


A 


13862 


167 


33 


VFTVPGTYMCVSVRDLGQAWWLTPVIPA 
LWEAE AGGS LEE FLRT 


13697 


27598 


A 


13863 


448 


355 


FFFFFFFFFFFIFFFFFYFFFFFFFAKN 
PLM 


13698 


27599 


A 


13864 


321 


445 


SQSTQAWCLMPVIPALWETKSSESFEPR 
SLRPAWATGHNPI 


13699 


27600 


A 


13865 


380 


80 


MILSPSPKFPSFLFSLFSALLLWHCLEL 
PFLSNVGKGMQNDPVGLSPTSTPSDLSR 
IYLFFCFFVCLFVCFETGFPSVAQAGVQ 
WCDPLGLKWFLPPQS P 


13700 


27601 


A 


13866 


372 


264 


KMENPEDKNFFIFFMFFIVFLLFFFFFL 
FCLTQFF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginme, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, y=Tyrosine, 

X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13701 


27602 


A 


13867 


161 


304 


LFIAEMTSHYIAQAGLELLASSNPPTSA 
S ERTG I TDGATE I LPQLRL 


13702 


27603 


A 


13868 


230 


348 


VSFI QNDVF CVERVYRVPDFGVWERGS K 
YNNGSTELHSK 


13703 


27604 


A 


13869 


168 


260 


EVIAECYNGCFQTLSLCVCVCVCVCVCV 
CY 


13704 


27605 


A 


13870 


140 


16 


LKYGQWRDLGSLQPPPPRFKQFSCLSLL 
SSRNCTTMTRDHE 


13705 


27606 


A 


13871 


143 


30 


KILKVWSQAWWLIPVIPALWEAEVGGS 
LEPRISRIiA 


13706 


27607 


A 


13872 


158 


59 


HLHGCIHLLKLRIiDAVSHACNSSTLGGQ 
GGRIA 


13707 


27608 


A 


13873 


232 


115 


APFFFKGAPIFFFFFFFFFFFIFFFLIR 
GKPGNDVGVEG 


13708 


27609 


A 


13874 


1619 


1375 


KTT WRRHLE KLQDMS LLRHR VHVDS S QG 
LFPFFQRQGLLPRLKCNGDI I PYCNLKL 
LGWSNS PAS AFR VART TS L CRHT WL 


13709 


27610 


A 


13875 


219 


84 


NPLQSTSTICNRAWDRVTCLWSQLLRRL 
RWEDHLSPGVPSCSAL 


13710 


27611 


A 


13876 


166 


496 


KKKKKKKKKGGGPLKKTPGGAKINRGRK 
KKIFFQKGGQKKTPGGILKKKPFLGGGK 
KGPNPPKKIKGLREKKKFKRGKGAKPAQ 
NPWGKKIS P PGFFLKKFFPRGRGFF I 


13711 


27612 


A 


13877 


132 


348 


P S KKKKRGG P F KRTPGGP KFNGGGKGNS 
FPLMGGGIKPNLGILGKNPYLGGGTNGN 
NP P TE I KGFGE KKKF 


13712 


27613 


A 


13878 


287 


134 


QQGP PIS PGP KEENRMGP PS PG WGNPG I 
RS P PGAGPNPGAFQDGMFPS KKQ 


13713 


27614 


A 


13879 


333 


480 


VHPLRSAEGPRPESNMLHLTLKSKQQWL 
TPVI PALWEAKAGGSQGQE IE 


13714 


27615 


A 


13880 


1 


227 


PKIHCKIQHIWLVSFFPSFFIjSFFQGE 
SCSAAQAGVQWHDLGSLQAPPPGFKWFR 
FIELLGLGFHSRRMRGKDS 


13715 


27616 


A 


13881 


276 


378 


GWPWWLTPVIPALLEAEVGGLLEPRSLR 
LALAT 


13716 


27617 


A 


13882 


126 


389 


GS VLGTGCCGS VLGTVCRGS VLGTVL I A 
SATPDRYLRLSSGSSLTRITNSDLVPWL 
TPVI PALWEAEAGVS LE PR I WR PARE TW 
EDPH 


13717 


27618 


A 


13883 


414 


150 


ARQAPKWGNPTGSPPPGFLNPPPKNFFL 
GPQKKKIFFPTPPPPHFFFFKGPPPFFF 
FFFFFFFSPHASGILjLCHPGRIAVAQSW 
LTE 


13718 


27619 


A 


13884 


197 


363 


DLGWWNTFMLKFKCLWSRGHVQWL I 
IPVIPILWKAEAGRSLKPRSLRPAWAT 


13719 


27620 


A 


13885 


101 


236 


NIHYANFPYFGIiACPEYIVRALQKKYTQ 
THTHTHTHTHTRTTHSL 


13720 


27621 


A 


13887 


17 


140 


TVHLKMVKIGLHMVAHACMPSTLGGRGG 
PIMRSGALGNFCL 


13721 


27622 


A 


13888 


135 


1 


VLHAGLKLLGSSTS PTSASKGAGTVGIH 
YHTQIAFQFFLFCFLiIR 


13722 


27623 


A 


13889 


411 


276 


TLYFKHGEMGFHHVGQAGLELLASSDPP 
ALEFETSLHNKVRPRL 


13723 


27624 


A 


13890 


216 


54 


EFKLCYTLFDHSYFPLEELVSGQVWRLR 
PAI PS FWEAKEGGS LEPRS S KHE IV 


13724 


27625 


A 


13891 


3 


149 


SSDPRPSSSWLAWVGLWSMVWRVPPFLL 
PILFLASHVGKSPLSPSPHF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine } R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


13725 


27626 


A 


13892 


166 


1 


LFSKIFWWLLPVIPALREAKVGELLEPG 
SSRPAWATWKNPVYTKNTKISQAWQCT 


13726 


27627 


A 


13893 


128 


2 


FSTTYPNP I KLPHPYLPLLTLFLDSAHL 
HPS E INSLVAQKKK 


13727 


27628 


A 


13894 


225 


391 


HLSIHILTFPFSLPEQWWLVPIIPVLWE 
VKMGGDLEPRSSRPAWATQQDPQVSRK 


13728 


27629 


A 


13895 


192 


1 


TLVCVSSSVPKNYNNICLACWFMPIIPA 
LWEVESGSLLELKSSPPAWPTWWVPLKS 
KNTKISLA 


13729 


27630 


A 


13896 


152 


. 245 


DQS IADVLMYYTHTHTHTHTHTHTLQNQ 
VPG 


13730 


27631 


A 


13897 


270 


406 


ISTRFFLYRDRASLCHPGWSAGVQSWLP 
GSSNSQAQAILLPQPPE 


13731 


27632 


A 


13898 


232 


73 


E AEGP RD VT S S T FHWRL YD Q TTN YHE CR 
MWANSRLRIiTPEGLLKVITPWCHAH 


13732 


27633 


A 


13899 


107 


7 


LGQAQWLMP 1 1 PAIiWEAKAGGS LEPRTS 
RPAWA 


13733 


27634 


A 


13900 


62 


224 


PTRPILAHCNLCVLGSDDPPASASQSAG 
VTGMSHCAWSNLI IHLPSRSTAICTL 


13734 


27635 


A 


13901 


450 


194 


KRQVKSHS E VKTTHVQ WPG IDPGS PAWE 
ARILPLNHQRLSGTTPWRITEKSNHKDL 
ETVLSGFFKCRLKANKDIQTKCVFIGNF 
Y 


13735 


27636 


A 


13902 


300 


404 


AAPGRAPQKKKKKKKKKKKKKKKKKKAP 
FIKRGR 


13736 


27637 


A 


13903 


131 


29 


ALGSLQPLSPRFKRFSCLSLPSSWNSTA 
LTVTTM 


13737 


27638 


A 


13904 


267 


74 


VWVLVPPFTSYGVLDKPILIVNNIYVHV 
CVCLCMCVCVCVCVCCTVIRRHCWHNMI 
IIISQHDQ 


13738 


27639 


A 


13905 


193 


64 


IRSHQIHKLHFKKFRILGWTWWLTPVI P 
APWEAEEGRSPEPSS 


13739 


27640 


A 


13906 


309 


424 


QSTDFLENRRVGWAWWLTPVI PALWEAE 
VGGLPEIKNS 


13740 


27641 


A 


13907 


404 


251 


SCLISLRSWDYRHVPPQPANFVFFFFEM 
ES CS VAQSGVQWQEENS I S KINK 


13741 


27642 


A 


13908 


471 


59 


GPPQAKKGKGFPTPGPPQKKGGGQPKSG 
VLGFSPPPLGPCPGLGPPKGWGFWGGSF 
FFFFFFRKWEHKNELFPFSKTEKPRRAG 
EARKGSQSTKYAGLSRGERQASPTLRPG 
HLAKASAGGWLQYTRWQGKLPEPGN 


13742 


27643 


A 


13909 


479 


231 


QEYCMLIFCEHFGDFDLNIVFCLEKKMV 
GVIADFYLLSSFTGRPQWLMPVIPTLWE 
AEVGGSLEAKSFTPAWATWQDPISMF 


13743 


27644 


A 


13910 


167 


19 


CIVLGIMLDSLWEPRQWVPSWVRWFTPV 
IPALWEAEAGGSPEVRSSNSS 


13744 


27645 


A 


13911 


328 


446 


GSFSKITDLSQTWWYVPWIiALQESEVG 
RSLEPRSSRLQ 


13745 


27646 


A 


13912 


145 


460 


NAFMSTGCEVLSYSELRQRNLVWWFLPV 
I PATPQAEAGRLLGPRS SRP P WATKGVR 
ALSHKQTNTKTNKIjSHSTGVS FVFTHHI 
HSPGIDWRLSQWLLAMNLATN 


13746 


27647 


A 


13913 


319 


210 


KLGFFKGPPLFFFFFFFFFYFFFFFGFL 
TVS CNF TL 


13747 


27648 


A 


13914 


372 


10 


PNTTCTP I T YGKKCCPRE I S Y YLCHE F I 
R I R I LTFFDGE S C YI AQAGLKLLGS RS P 
PTSASGVAGNTDVCHHTQPILFYILETG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, BNHistidine, I~Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, -—Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LALLPRLILNSWSQAILLPPQPLRSYFN 
FWLFFCLYN 


13748 


27649 


A 


13915 


594 


471 


PSYIAYMCICMCVCVYVYINICVCIYTH 
VHTPYNPVISKN 


13749 


27650 


A 


13916 


235 


3 


EANNLIRKTYTSGWFQRTSKRPPAVRRM 
KVL FFETESCS VAP VGVE WHDH S LL Q PR 
PPGLNRSSHLSFPSSSDYRWA 


13750 


27651 


A 


13917 


272 


184 


K.TT AW AQWLT P V I S TLWE AKVGGSLjE AG 
S 


13751 


27652 


A 


13918 


46 


321 


S RVQGTGGHRGRG C I QAG VGE QEAKRE T 
EGP PWRWLTP VI S KLWEAKAGGS hE P VS 
SRRATWNLLKR I Y YTDERE YL I PLALNT 
SQGSTFSF 


13752 


27653 


A 


13919 


387 


484 


FIE I IGWARWLMPVIPTLWEAEAGGS PE 
VRSS 


13753 


27654 


A 


13920 


142 


321 


FLFSIHFHLSCWGLFSNVWQSLSIFSYL 
KVRLYGQAWWFTPI ILALWEDKAGGSLD 
PRSL 


13754 


27655 


A 


13921 


270 


412 


IEDKKIYSILHTETLERGQARWLTPVIP 
AFWEVKAGGSFEPRSLRPV 


13755 


27656 


A 


13922 


304 


396 


NFGWAQWLTPVIPAL.WESEAGGSLEPRG 
LRS 


13756 


27657 


A 


13923 


305 


387 


VWWLTPVIPALWEAEAGGSPEPRSSRP 


13757 


27658 


A 


13924 


103 


3 


NKS P E LGWAWWLAP V I P AL WE AE AGGS P 
EVRSS 


13758 


27659 


A 


13925 


195 


400 


CLLILSGLVASCI IERHGKGQAWWLMPV 
IPTLWEAKVGGSLELRNLRSAWGTWRNS 
ISIKYTEQLAQG 


13759 


27660 


A 


13926 


294 


489 


KWAKMQTLHKGR YAGVC I C VCVCVC VYT 
YTD I Y I FYIHI WPMNTKCSMLLVIGEMK 
ILSRMWSNQ 


13760 


27661 


A 


13927 


363 


2 


NWGPPGFPPPPFLKTGPVFFIFGAPKKK 
FFLSTPRALKFVLLKGGPLFFFFFFLWV 
FLVKTGSHCVAQPDLELLSSSNPVLVSQ 
GAGITGMSPQVQPLDLFFFFSFFETGSR 
SYAWADAW 


13761 


27662 


A 


13928 


1 


218 


LRFKKRKTNKPKMWSVHTMEYHSALKRK 
EIRTQTTRWINLEDIVLSEIRVAVTKGR 
ILYDSTQMRNLEESNL 


13762 


27663 


A 


13929 


219 


408 


TFCHFI AFLHVYKMF WFFFS F IMFFFP 
LNCNKAFERKKKKNKKNNKKKKKRKKI S 
liSPPAPP 


13763 


27664 


A 


13930 


161 


22 


WHP PASASHVAGTTGACHHERLRQEDHF 
SLRVQGCSEPRLCHHTPP 


13764 


27665 


A 


13931 


116 


1 


INTGWARWLTPVI PAFWEAAVGGSLEPR 
STSQAWATWK 


13765 


27666 


A 


13932 


181 


38 


PWPQRATQPKI S PLPPAI SALWEAEAGG 
ALE PRS SRPAWAKSDAWVD 


13766 


27667 


A 


13933 


267 


397 


TVFDRTLFFIKKI IHSWAPWLTPWPAL 
WEAEVCGSPEVRSSG 


13767 


27668 


A 


13934 


129 


12 


NIGVGWALWLTS VI PALWEAEAGGPPKV 
RSSRTARPMS 


13768 


27669 


A 


13935 


179 


1 


KT YYGG I PPPGPPKKGEKKP P FFFKKFF 
LFFISFFI FFFFFFLRQSLAKMQSAS VW 
KVS 


13769 


27670 


A 


13936 


245 


616 


DDKKKKEAAQKKATEQKIKVPEQIKPSV 
S QPQPANSNNGTS TATSTKNNAKRATAN 



1383 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO; of 
nucleotide 
sequence 


SEQ ID 
NO; of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possibIe 
nucleotide insertion 














NQQPQQQQQQQQPQQQQPQQQPQPQPQQ 
QQPQQQPQALPRYPREVPPRFRHQEHKQ 
LLKRGQHFPVEF 


13770 


27671 


A 


13937 


121 


292 


NTDLDLALAL I VRVLT S DDN FKNWG W VQ 
WLM P VFPAP WE AE VGGS S E ATNLR P AWA 
T 


13771 


27672 


A 


13938 


421 


202 


GPREAPLFPSTQGISRPLFFFFFFETES 
RTVTRAAAQWHNLGSPQPPLCSLDILKT 
SQS VCMPEAAILVSQ I T 


13772 


27673 


A 


13939 


205 


543 


I LE SHRRLG WS E ALP ENQLL CRAS S WES 
LGPPDPPCLKPPSPAFWGAAGGWRAKST 
SVSEPVPVGTLLVALFLEMAVGLLLGLP 
FCFHSLGAPHLLCEPDGPSVLPAEARQS 
R 


13773 


27674 


A 


13940 


140 


2 


LSQHSEEYTLPPPLYLEKESSRPPPPQH 
TPHPPPPWPWHPDIADIQ 


13774 


27675 


A 


13941 


47 


303 


GLI FLI I WTCIVKTSTDFPRMEDCSQCI 
HQVTEESNKRMGFLSYYANPHHGSSRLL 
WPQHAAPWDDGRRGKPVFSLGFVSFPFP 
Q 


13775 


27676 


A 


13942 


L48S 


1719 


PLVS FS KMTYPCLGMVKQNQKTTFCSRW 
KLHVLS SNLS S PAE VTWAS INLWSEQ 
SFQNSQLPAMLCLIVHLQMS 


13776 


27677 


A 


13943 


127 


402 


QPVTHPPNRTMCSSMIFLFLYLCFLSLS 
PFPSHLSLYAPPKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKAGGGGP FK 
KKIHFPPGGGRI IFFY 


13777 


27678 


A 


13944 


852 


209 


EQTVYCTRIDLQQKLVFLRLEAFCLLSQ 
GDPGSPMMCQLQQFDLWVLRGVLNFGGE 
TCPGLFLYTKVEDYS KW I TS KAERAGPP 
LS S LHHWEKL I S FSHHGPNATMTQKT YS 
DSELGHVGSYLQGQRRTITHSRLGNSSR 
DS LD VRE KDVKE S GRS P E AS VQ P LY YD Y 
YGGEVGEGRI FAGQNRLYQPEE I ILGSL 
RACFLFAAVSSPGATPPN 


13778 


27679 


A 


13945 


68 


260 


INIFWHCVLLYVILYAISQFLLFPCVKN 
IASWLGSVAHACNPSTLGGQAGESLEPR 
NLRPAWAT 


13779 


27680 


A 


13946 


420 


156 


FLP PGVKYGS FKRAP P FF FFF FFFLiVET 
LSCYIAHTGLELPDSSNYPTSASQSAVI 
TGMSHHTWPLEGACLAIPQALRILLVLV 
PLH 


13780 


27681 


A 


13947 


317 


17 


AGHGGS TCNPSQP I VTAVPINLWGRDLL 
QQWGAQAFI PEQLYSPQSQHTMQEMGYV 
PGMGLEKNLKGLKKPLQAEGQNSHQGLG 
YNFSWQPLLSLQNLYL 


13781 


27682 


A 


13948 


55 


224 


NFRLTS S FSYLKMS FWLGMWLT P VI PAL 
WEAKTGGLFEARSLRLVWWQS KKAGSHL 


13782 


27683 


A 


13949 


1464 


805 


RATS VRGAGRER S CGAAWS PAS IGPFLR 
RS VL PNAHFRS S RE GGMAAS TDMAGLE E 
S FRKFAI HGDPKASGQEMNGKNWAKLCK 
DCKVADGKSVTGTDVDIVFSKVKGKSAR 
VINYEEFKKALEELATKRFKGKSKEEAF 
DAI CQL VAGKEPANVG VTKAKTGGAVDR 
LTDT S R Y TG S HKE RFD E S GKGKG I AGRQ 
D I LDDSGYVS AYKNAGTYDAKVKK 


13783 


27684 


A 


13950 


356 


98 


NAGP G I YF WG P I KKI L P C P PAGVKLGSL 
KRAPLF F F F VE MG FHHVAQAGL KLL S S R 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine 7 I=IsoIeucine, 
K=Lysine, Ii=Leucine, M=Methionine, 
N=Asparagine, PHProIine, 
Q— Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan 7 Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














DSPIPVSQSIRITSTCEPPCLAKIKSSL 
RL 


13784 


27685 


A 


13951 


194 


48 


NFLEGWPGQRLTPVIPALWKAEAGGSLE 
PKTFRPVWAMWRDRLRKPQH 


13785 


27686 


A 


13952 


341 


418 


DQPGQHGETPS ILKIQKLAGRGSTCL 


13786 


27687 


A 


13953 


158 


323 


TWLKKYFQSTNQLPPAHLLNVQPLLPFG 
LPLLPLKKKKKKKKKKKKKKKKKKI KK 


13787 


27688 


A 


13954 


231 


355 


VFLPQKQKQNQCPLTGKWVNNSWNIHTL 
EYYSAVKGMKQLI 


13788 


27689 


A 


13955 


493 


39 


PAAFSSWGWAPGSSLLKKTTPASWPPGL, 
LPPAPVSSMAAPDLDSFPLPLCWTFLRV 
LCRVFS S STIiAFLS CVS TS FGALTPAAS 
SNIFSISLPHWASPPLPTARLTSAVSPE 
PLPPL.PSRLQRQPRGAPSHPGMSMGPIL 
DATSNSTTRSR 


13789 


27690 


A 


13956 


497 


372 


NTKISWASWYAPVIPATWEVSNSRPQAI 
HSHQPPKRIGLQA 


13790 


27691 


A 


13957 


438 


301 


GTTYLDIDMTCDYVSVCLCIYICTRQQS 
YIYTYVYTHTHTHTHMYI 


13791 


27692 


A 


13958 


84 


466 


QPLGRSGKVPQLHLWEMQRLPAAFFRSF 
AAQGLGESVHINTPTSLSLRGRPFISSS 
GPGARARVTCAPCLHAARILCCFWRGLH 
RL KCS NV I S AHCNIiRFS GS S D S PAS AS Q 
VAGITGRSSLEQLLE 


13792 


27693 


A 


13959 


216 


28 


KYVKNYVLSS 1 1 FWMQE I FHNFKKS S RT 
QWLTPVIPALWEAEAGGSPEPRSSRPAS 
SIQLENS 


13793 


27694 


A 


13960 


320 


943 


VL S F S L S DRAQR CNCRMG KQNS KLAP EV 
MEDLVKSTEFNEHELKQWYKGFLKDCPS 
GRLNLEEFQQLYVKFFPYGDASKFAQHA 
FRTFDKNGDGTIDFREFICALS ITSRGS 
FEQKLNWAFNMYDIiDGDGKI TRVEMLE I 
I E AI Y KMVGT V I MMKMNE DGLT P EQR VD 
KIFSKMDKNKDDQITLDEFKEAAKSDPS 
IVLLLQCDIQK 


13794 


27695 


A 


13961 


191 


8 


GLFRFRRLSEEVKHCCCCCCCCCCCCCC 
CCCCCCCCCCCYLRLAPQSPAAPAPEIjH 
TPLPGSRRNR 


13795 


27696 


A 


13962 


203 


28 


VWGNQHF C F VP V FRKKVF C F F P LMKLLR 
GGFFFFFFFFFLVFIDNSWVFLGEGDLA 
GS 


13796 


27697 


A 


13963 


18 


171 


GIRHEERERERERERERERERERERERH 
PARE I DM CVS KRD TRGAL CAL F 


13797 


27698 


A 


13964 


25 


299 


HDFCTRRERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERERERERERERETHTLSLYIYMCV 
FLYIPLYFSFLYIHTQREAPPPLSATLS 
LSIL 


13798 


27699 


A 


13965 


246 


410 


AYNGQGSRGVTPVIPSDWEAKTGGSLEP 
RSWRSAWATWHTP PVPGPS KE I SLYI 


13799 


27700 


A 


13966 


91 


26 | 


FCCCGCCCCCCFYCKKEKTLV 


13800 


27701 


A 


13967 


158 


24 


PFCHIYIYFLESCFVAMLECSGKILAHC 
YLRLLGSSNSLVPSIiA 


13801 


27702 


A 


13968 


129 


7 


SWAQWLMPI IPTLWEHEAGRSLEAMSSR 
PAWATQQDPVSKK 


13802 


27703 


A 


13969 


445 


268 


KIQLTKLKNASESLPSRINQAEEKXSGW 
AWCLM P VI PVLWE AKAGGS LE PRS LRRG 
QVT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I-Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


13803 


27704 


A 


13970 


1628 


1309 


HRRKKCAGYS S LS S ALLRERERERERER 
ERERESVCVCVLGTITCFRRQYCVIQGI 
SGVRQNCVQIPALSDPDLFLFLSLSVLH 
SNTWWRP TLRVTERNKGAMC V 


13804 


27705 


A 


33971 


314 


140 


QCHNCYILTQLVKIKGDQAWWLMPI I PA 
LGAKAGGSLEAGSLRSAWATWRAPMSTK 
IF 


13805 


27706 


A 


13972 


292 


482 


YKSVRTQDSCAHGVDRLVGKIRYVNLFV 
YICQHSDGRTWWLTPVIPPLWETKGGGS 
SEVGRSG 


13806 


27707 


A 


13973 


502 


303 


STAIGPSFSPFLCTHGSFHGFLFFFFCF 
CFFLVEMGSHYVAPAGLKLLGSSDPPAS 
ASPSHLGLQV 


13807 


27708 


A 


13974 


493 


289 


I P PTALGTS FS P FLCTHGS FHGFLF FFF 
CFCFFLVEMGSHYVAPAGLKLLGSSDPP 
AS ASPSHLGLQV 


13808 


27709 


A 


13975 


405 


121 


IREAAQSESLFVRAAPHTGLISGPCITE 
I YAVKDTRCMQRR I L VRE L E C AAHGL T E 
CLS IDTHTHTHTHTHTHTHTHTHTHTHT 
HGTRWEAGQSLKGES 


13809 


27710 


A 


13976 


135 


254 


TVTCTNSSWSLTFTSFLFLMPGWIQKGR 
RVCVCVCVCVC 


13810 


27711 


A 


13977 


179 


58 


VPLS INSPQRQC I FCCCCCCCCCFLVLQ 
KIISERSWAEIG 


13811 


27712 


A 


13978 


97 


186 


KI I FWLGWAHACNPSTLGGQGGRIMRG 
RD 


13812 


27713 


A 


13979 


289 


420 


ELGLKIHGFPSQAPWFMPI IPAL.WEAEA 
GGLLESRSSRIAWAMW 


13813 


27714 


A 


13980 


421 


106 


SGGRNFFFFLGGGFYKKKFPPPPFLRGK 
KKKPPLPKKKNPTPIFFPPGGGPPFSPP 
PKKGGGKIFFFFGGKNFIMQNFGGFPPL 
WG KKKKFL P KKKNPL F F F F F F 


13814 


27715 


A 


13981 


309 


135 


KHIYIYLSIYLSIYLSIYLSIYLPAYLP 
T YLHAHPL I HANAYKN I H I D Y P FKKG FE 
Y 


13815 


27716 


A 


13982 


231 


21 


QHMYCLKS I WHKKLSRKP YC Y I I SE YS S 
MVKLR VS I S Q VQ WLM P VI PAL WE AKAGG 
LLKARSSRPMWAI 


13816 


27717 


A 


13983 


209 


359 


HLHTHTHTHTPNGVKQ I MTQ I P ILPLTG 
PLILEKFLNFSKPLSHPGKRDP 


13817 


27718 


A 


13984 


391 


249 


IWPILPPQKIFFFFFETESCSVTQARVQ 
WCNLGSLQPPNPVFPSLAK 


13818 


27719 


A 


13985 


181 


403 


YLVLDSPESTLGILSQPKDYITKDQTLT 
FRP VWAHACNPS TLGS QGKW I TS VQE V 
ETSLCNTARPCLYQKYKN 


13819 


27720 


A 


13986 


441 


358 


ETGSRCVILAGAQWRDLSSVQPLPPRFK 


13820 


27721 


A 


13987 


7 


233 


AS I P C I YQKTKI PNTKQTKNKNTKITPA 
WWCTPIAPATQKAEIPPLHSSLGDFLRL 
SPKKKKGGPNWAPTPSKFF 


13821 


27722 


A 


13988 


152 


44 


PKAPLFFFFFFFFFFYETRSHSVAQAKV 
QRHDHGSL 


13822 


27723 


A 


13989 


188 


379 1 


NVS CHAAETRVW WE KERPLP VP FGAYLL 
DGMPLVWLLSASSILSIWLSIYLSIYLS 
IYLSIYF 


13823 


27724 


A 


13991 


296 


53 


SSILFHASKYHFSGRLFSLLSCKRRNIL 
QL VPHM YS P YL YMY I HTHKHI YE YMYVH 
IYIYMNLRNCRIYLHSCTPKEMQAR 


13824 


27725 


A 


13992 


37 


290 


RLGSPASRHQQIQCLIRACLKDGTLGWV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P= Proline, 
Q=Glutamine, R=Arginine, S— Serine, 
T— Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














WWLIPVIPALWEAKVGGSLEPRSSTPAW 
ATQGDLILKKDGTLLQCPHVAKGHSSHS 


13825 


27726 


A 


13993 


189 


370 


S RNA I F KKR WE S S CQ S S FNS LGVF PQAA 
TKGWVRWLTLVIPALWEAKAGGSLEPRS 
SRPA 


13826 


27727 


A 


13994 


179 


25 


SKHNKKNIQICLLCSKRSMFVHVCIHIH 
ICVYVYRYAYIYIGMCIHIYIHS 


13827 


27728 


A 


13995 


159 


2 


E D P L S P G VQDQHGHHR KS L S GRGGK I S L 
GQEFKSSMAAMYFGRSGGCKLERR 


13828 


27729 


A 


13996 


307 


99 


FIWEHFVNYTMFFTYKKPTSQAWWLMPV 
VP AL WE TE AGG S P E VR S PRP P GQHDD VR 
PLGPSLHVYLQMD 


13829 


27730 


A 


13997 


218 


21 


FQHFGRPRRVDHLS LGVQDQ PGQHGET P 
SPLASSILDLIPPPFPIDEPVDLAFQSV 
TLGVQDTTS 


13830 


27731 


A 


13998 


227 


404 


GYSVTLLLGLTQLTQRIAHGTRKGERER 
ERERERERERVCS CLMHELASHLLFFFL 
FRE 


13831 


27732 


A 


13999 


400 


668 


PVGRRWE LQGMGLGVRT WQ S S LP CQGP T 
SSSVLSLGRGTPGISNTCLPPRSATELG 
LPAQVPFPNVQSQDQDLSFSLGCFPRTY 
NRQTL 


13832 


27733 


A 


14000 


225 


414 


NRVLLCHTGWST VAI LAYCS FKLLGSRD 
S P TLAS RAART X VKCRNVWL I KKKKKKK 
KGGRRL 


13833 


27734 


A 


14001 


397 


118 


LFSGPFFFDKSECFQEPTVPFCLLLYNQ 
MQPYLLRSAFVDSYYMYDSFLRDRVSFC 
QSGWSTVAPSRLTAALNSLAQVILLPQP 
LNVLCGISR 


13834 


27735 


A 


14002 


3 


173 


YVFYRQKQMVCF YKI R I KNMFNRNRLNL 
GQWCVPIVPATQVAEAGGSPEPTSLRPI 


13835 


27736 


A 


14003 


440 


290 


YTHTL Y I Y I Y IHTHTHTHT Y I YL CT YWG 
QFDLLVLDMKLYKE I YYLSVFI 


13836 


27737 


A 


14004 


2 


116 


ARLVAMP FKYEE I KDFLLTARRKDAKS V 
KIKKNKSAAV 


13837 


27738 


A 


14005 


279 


422 


S AKAPLS CLRTNS LLLKNGLRGWAW WI T 
PVIPTLWEAEVGGSPDVTSP 


13838 


27739 


A 


14006 


3 


497 


GGIGDSRCGSTRASSSPQLAGRSSSVLP 
AAAQPCTPTMDVFKKGFSIAKEGWGAV 
E KTKQGVTEAAE KTKEGVM YVGAKTKEN 
WQS VTS VAE KT KE QANAVS EAWS S VN 
TVATKTVEEAENIAVTSGWRKEDLRPS 
APQQEGEASKEKEEVAEEAQSGGD 


13839 


27740 


A 


14007 


1 


292 


SLGGGGCNDPRSYHCTPAWATEQDTVSL 
SLSIYMSVRVCIYMYMCVCVCIRIYTYI 
HTHIHIRVYIYTIPYIWGIYIYIYVDLV 
IYMGWCGCPPPSI 


13840 


27741 


A 


14008 


57 


250 


KI FFFLMNHTWGGVPWLATWEAEVGGS 
LE PRS LGLQETMMTS LLS RLPRQQGE I L 
SLILKRNY 


13841 


27742 


A 


14009 


184 


2 


AVPGRWPCQQVPGLLPS IQDTLKQGRTG 
QWLTPVTPMPWEAEAGGSLDARSLRPAW 
ATGK 


13842 


27743 


A 


14010 


123 


3 


KEPSRPGRWLTSI IPAGWEAEAGGPPEA 
RSSTREFRTAK 


13843 


27744 


A 


14011 


403 


168 


GTGLRASRCISQPVMSGLLSPSEFPFPE 
I ESHLHVS QRETAARLRDRERERARERE 
RERERGKNAS S GKP PGS AS C VP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ 10 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cyste'me^ 0=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine 7 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=JLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S-Serine, 
l=lnreomne, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


13844 


27745 


A 


14012 


207 




E I RRG S LAGPL S LNF I L S VMGS HS K YLR 
KVIGRAQWLMPWPALWEAEVGKLHELR 
RLGHTWAAW 


13845 


27746 


A 


14013 


200 


405 


WLCVLGWS LLALS PRLEC YGP I S AHCNL 
HLLRSSDSPASASQALQEAWYQHLLLMM 
PQAAS TQVGRKK 


13846 


27747 


A 


14014 


319 


176 


KPSTITFLFLHLFISSRESRPVTQXEVK " 
WLNHSS PQPPTPRFSLPSS 


13847 


27748 


A 


14015 


63 


401 


E Q ENKNTLNS P GWDGVH I HVLR FLNTRM 
PLGHLS FS EGVSGS GSSGRRCQAGAGGR 
SATAAARPCQCLGLKGLPPAVPSCGLEG 
KAPERAGYSASHPPHGCETGNGWVLYVN 


13848 


27749 


A 




/lift 


63 


VSKVYPKINHTQRTVRHETFKIAMPKYY 
SNLCSILRFNKDRLAFMRHYTKQCSRTS 
ITEYIAMFFFVCL.FVLKIGAHCHPGWNE 
WL VCS Q VTAAS TS RAQAI LPPQPPKIL 
GMQQA 


13849 


27750 


A 


14017 


1 Qft 


425 


SLMFRPPLFSFCCFCIRYPFIFVFFSLL 
CFAFSWCSLLLFFFHYLKLFLLLFLLFF 
LPVLLiCLDSFIjVHDFFLLSS 


13850 


27751 


A 


14018 


3 


233 


SSRMGRGTKTIAFDMRWS GEVDRARHRD 
REREREREREREREKGAEPQEFPVISPW 
V UK F LiAG J. JjHIjK CG CGHRL 


13851 


27752 


A 


14019 


163 


i 
i 


.BJiK w tiJUy JL C V JSJttJB 1 WGWVQWLMPVI PVIj 
WE AKAKNLLE P KNL KLQRNRRP GNS R 


13852 


27753 


A 


14020 


203 




IjAAKKJNI K.S QAQW Jj'i'S WMS QSAEAGG S 
LEPRNSRPAWAT 


13853 


27754 


A 


14021 


179 


j 


VTWQRELRLQMELRLLICRAWWLTPVI P 
VP WEAKAGGS PE PR S S R PAWETRNRR P G 

■Wo 


13854 


27755 


A 


14022 


358 


431 


WHKKKNQNWLGTGAHACNPSTLGG 


13855 


27756 


A 


14023 


98 


3 


GRPAGWLTPVAPTLWEAEMGGSIiEPRSL "" 
GHE 


13856 


27757 


A 


14024 


9^4 


1 Or 


RENWAE P S SHSLLN I FLQLGTVARACNP 
STLGGRGGWITRSGD 


13857 


27758 


A 


14025 


140 


1 


KGVLDGPRGLCVFCFCFFFLCVCVCVCV 
CVCVWRRSLTLSPRLQCI 


13858 


27759 


A 


14026 


579 


852 


QREWVGWAGKEGEGWVSHVPASAQLPAN 
GQRGQPHPSPLGGTGSTWAQEGAYCCLS 
SCSHCCCCSSCCCCCPPGFCLFPSLGAI 
CHLYHL 


13859 


27760 


A 


1 A-fYlH 
I H-Ui / 




663 


LRIRALRELPASHIPGSLTICCVPRPPL 
P CS S T KPDAKG YKP LAQ QKLGS QR TGP C 
LGHAPCYS PLWELRGHGCS PYGPS PSGS 
VS L WQE AMRL P KNT PEE KDRRTAALQ E G 
LRRAVSVPLTIiAETVASLWPALQELARC 
GNLACRS DLQVAAKALEMGVFGAYFNVL 
INIiRDITDEAFKDQIHHRVS SLLQEAKT 
QAALVI/DCIiETRQE 


13860 


27761 


A 


14028 


1 


267 


AVGVHHAFHLPHCFFASLLES P VS PRLA 
MDPNCS CAAGVS CTCAGSCKCKECKCTS 
CKKSCCSCCPVGCSKCAQGCVCKGASEK 
CSCCD 


13H61 


27762 


A 


14029 


46 


1746 

1 


PAAGAATMEFRQEE FRKLAGRALGKLHR 
LLEKRQEGAETLELSADGRPVTTQTRDP 
PWDCTCFGIiPRRYI IAIMSGLGFCISF 
GIRCNLGVAIVSMVNNSTTHRGGHVWQ 
PCAQFS WD PETVGL IHGS FFWGYI VTQ I P 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=L,ysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














GGF I CQKFAANRVPGFAI VAT S TLNML I 
PSAARVHYGCVI FVRILQGLVEGVT YPA 
CHG I WS KWAP PLERS RLATTAFCGS YAG 
AWAMPLAGVLVQYSGWSSVFYVYGS FG 
I FWYLFWLLVS YES PALHPS I SEEERKY 
IEDAIGESAKLMNPLTKFSTPWRRFFTS 
MPVYAIIVANFCRSWTFYLLLISQPAYF 
EEVFGFE IS KVGLVSALPHLVMTI I VP I 
GGQIADFDRSRRIMSTTNVRKLMNCGGF 
GMEATLLLWGYSHSKGVAI S FLVLAVG 
FSGFAI SGFISTVNHIjDIAPR YAS I LMGI S 
NGVGTLSGMVCPI IVGAMTKHKTREEWQ 
YVFL IAS LVH YGGVI FYGVFASGEKQ PW 
AEPEEMSEEKCGFVGHDQLAGSDDSEME 
DEAEPPGAPPAPPPSYGATHSTFQPPRP 
PPPVRDY 


13862 


27763 


A 


14030 


240 


63 


VFTAQSNGGVELNRFNEGPLSAVLKGRV 
AWAQWLTPVI PTL WE VKAGGLLE ARS LS 
PA 


13863 


27764 


A 


14031 


289 


414 


HAEMGRVQWLTPAI PALCKAEAGGP PEP 
RSLRPAWATY 


13864 


27765 


A 


14032 


386 


1765 


LGDARAP E KMS A I QAAW P S G T E C I AKYN 
FHGTAEQDLPFCKGDVLTIVAVTKDPNW 
YKAKNKVGREGI I PANYVQKREGVKAGT 
KLSLMPWFHGKITREQAERLLYPPETGL 
FLVRESTNYPGDYTLCVSCDGKVEHYRI 
MYHASKLSIDEEVYFENLMQLVEHYTSD 
ADGLCTRLIKPKVMEGTVAAQDEFYRSG 
WALNMKELKLLQT IGKGE FGDVMLGDYR 
GNKVAVKCI KNDATAQAFLAEAS VMTQIi 
RHSNLVQLLGVIVEEKGGLYIVTEYMAK 
GSLVDYLRSRGRSVLGGDCLLKFSLDVC 
EAMEYLEGNNFVHRDLAARNVLVSEDNV 
AKVSDFGLTKEASSTQDTGKLPVKWTAP 
EALREKKFSTKSDVWSFGILLWEIYSFG 
RVPYPRIPLiKDWPRVEKGYKMDAPDGC 
PPAVYEVMKNCWHLDAAMRPSFLQLREQ 
LEHIKTHELHIi 


13865 


27766 


A 


14033 


619 


414 


EQARCLEQHACTSPRQPRPQLQCSELKG 
HSLGMEHQGHVP CLHPCS P CP P VKAS PA 
CWSSNPAPLCPH 


13866 


27767 


A 


14034 


2 


614 


IiESRPDGRPSSTHPASPSAFSAPGKPHP 
PEAKMS S KRAKAKTTKKRPQRATSNVFA 
MFDQSQIQEFKEAFNMIDQNRDGFIDKE 
D LiHDMLAS LGKNP TDE YLEGMMS E APG P 
INFTMFLTMFGEKLNGTDPEDVIRNAFA 
CFDEEASGFIHEDHLRELLTTMGDRFTD 
EEVDEMYREAP IDKKGNFNYVEFTRILK 
HGAKDKDD 


13867 


27768 


A 


14035 


1 


254 


AVEFGPAGPGSLGRAAAMIIPVRCFTCG 
KIVGNKWEAYLGLLQAEYTEGDALDALG 
LKR Y C CRRMUjLAHVDL I E KLLN YAPLE K 


13868 


27769 


A 


14036 


494 


148 


FRFL S D CGVFAEGH I E LQ VE S GVP LGF S 
TMAEDME T KI KN YKTAP FDS R F PNQNQT 
RNCWQNYLDFHRCQKAMTAKGGD I S VCE 
WYQRVYQSLCPTSWVTDWDEQRAEGTFP 
GKI 


13869 


27770 


A 


14037 


130 


3969 


I MGDVKNFL YAWCGKRKMT PS YE I RAVG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ IB 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
KNLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NKNRQKFMCEVQVEGYNYTGMGNS TNKK 
DAQSNAARDFVNYLVRINE I KSEEVPAF 
GVASPPPLTDTPDTTANAEGDLPTTMGG 
PLPPHLALKAENNSEVGASGYGVPGPTW 
DRGANLKDYYSRKEEQEVQATLESEEVD 
LNAGLHGNWTLENAKARLNQYFQKEKIQ 
GEYKYTQVGPDHNRSFIAEMTIYIKQLG 
RRIFAREHGSNKKLAAQSCALSLVRQLY 
HLG WE AYS GLTKKKEGETVE P YKVNLS 
QDLEHQLQNI I QELNLE ILP P PEDPS VP 
VALNIGKLAQFEPSQRQNQVGWPWSPP 
QSNWNPWTSSNIDEGPLAFATPEQISMD 
LKNELMYQLEQDHDLQAILQERELLPVK 
KFE S E I LE AI S QNSWI I RGATG CGKTT 
Q VP Q F I LDD F I QNDRAAE CN I WTQPRR 
ISAVSVAERVAFERGEEPGKSCGYSVRF 
ESVLPRPHASIMFCTVGVLLRKLEAGIR 
GISHVIVDEIHERDINTSFLLWLRDW 
QAYPEVRIVFMSATIDTSMFCEYFFNCP 
SLKLWRTYPVQEYFLEDCIQMTHFVPPP 
KDKKKKDKDDDGGEDDDANCNL I CGDEY 
GPETRLSMSQLNEKETPFELIEALLKYI 
ETLNVPGAVLVFLPGWNLIYTMQKHLEM 
NPHFGSHRYQ I LP LHS QI PREEQRKVFD 
P VP VGVTKVI LS TN IAETS I TINDWYV 
IDS CKQKVKLFTAHNNMTNYS TVWAS KT 
NLEQRKGRAGRSTAGFCFHLCSRARFER 
LETHMTPEMFRTPLHEIALSIKLLRLGG 
IGQFLAKAIEPPPLDAVIEAEHTLRELD 
ALDANDELTPLGR I LAKLP I E PRFGKMM 
IMGC I FYVGDAI CT I AAATCFPE PFVNE 
GKQLGYIHRNFAGNRFSDHVALLSVFQA 
WDDARMGGE E AE I R F CEHKRLNMATLRM 
TWEAKVQLKEILINSGFPEDCLLTQVFT 
NTGPDNNLDWISLLAFGVYPNVCYHKE 
KRKILTTEGRNALIHKSSVNCPFSSQDM 
NYPS P FFVFGE KI RTRAI S AKGMTLVP P 
LQLLLFASKKVQSDGQIVLVDDWIKLQI 
SHE AAAC I TGLRAAME ALWE VTKQ P AI 
I S QLDP VNERMLNM I RQ I SE P S AAG INL 
MIGSTRYGDGPRPPKMARYDNGSGYRRG 
GS SYSGGGYGGGYSSGGYGSGGYGGSAT 
PSGRI CAG VGGG YRGVS RGGFRG3STS GGD 
YRGP S GG YRGSGG FQRGGGRGAYGTG YL 
D I EE E VAAI KLG YVS S VCRQ 


13870 


27771 


A 


14038 


431 


542 


EGITPGWARWLTPVI PGLWEEEAGGSPG 
REIDIGLAN 


13871 


27772 


A 


14040 


238 


362 


RR CT F L S FRMLAP R V Y S WGKRAFS P S V 
CVRAHGKCDYSYS 


13872 


27773 


A 


14041 


6290 


3514 


FRAAGSSSTNSGRICPLPSGALLYQSEG 

LIjARPHGKGS fqvgrqqhqavrvgs aha 

S S AG Y VC VS AFP F VFLRF C VGGGP G I S R 
VYALFYGE CNPTRE WAVS S ELS P S FQEQ 
NKMNKVEQKSQES VS FKDVTVGFTQEEW 
QHLDPSQRALYRDVMLENYSNLVSVGYC 
VHKPEVIFRLQQGEEPWKQEEEFPSQSF 
PEVWTADHLKERSQENQSKHLWEWFIN 
NEMLTKEQGDVIG I P FNVDVS SFP SRKM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first * 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gfutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, t=Isofeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














FCQCDS CGMS FNTVSEtiVI S KINYLGKK 
SDEFNACGKLLLNIKHDETHTREKNEVL 
KNRNTLSHRENTLQHEKIQTLDHNFEYS 
I CQETLLEKAVFNTRKRENAEENNCDYN 
EFGRTFCDSSSLLFHQISPSRDNHYEFS 
DCEKFLCVKSTLSKPHGVSMKHYDCGES 
GNNFRRKLCLSHLQKGDKGEKHFECNEC 
GKAFWEKSHLTRHQRVHTGQKPFQCNEC 
EKAFVJD KSNLTKHQRSHTGE KP FE CNE C 
GKAFSHKSALTLHQRTHTGEKPYQCNAC 
GETFYQKSDLTKHQRTHTGQKPYECYEC 
GKS FCMNSHLT VHQRTHTGE KP FE CLE C 
GKSFCQKSHLTQHQRTHIGDKPYECNAC 
GKT F YH KS VLTRHQ 1 1 HTGL KP YE C YE C 
GKT FCLKS DLT I HQRTHTGE KP FACPE C 
GKFFSHKSTLSQHYRTHTGEKPYECHEC 
GKIFYNKSYLTKHNRTHTGEKPYECNEC 
GKTFCQKSQLTQHQRIHIGEKPYECNEC 
GKAFCHKSALIVHQRTHTQEKPYKCNEC 
GKS FC VKS GL I FHE RKHTGE KP YE CNE C 
GKF FRHKS SLTVHHRAHTGE KS CQ CNE C 
GKI F YRKS ELAQHQRSHTGEKP YE CNTC 
RKTFSQKSNLIVHQRRHIGENLMNEMDI 
RNFQPQVSLHNASEYSHCGESPDDILNV 
Q 


13873 


27774 


A 


14043 


187 


422 


LTIPKLHSAQEPAGPVSPPPPPSPTFQI 
GGRSRAEPRNQYCPQVAPVPALRGCLPL 
SPGEPPHDTSSPRHLTCGGIVF 


13874 


27775 


A 


14044 


340 


442 


HKFLSWVRLLjMPVI PTFWEAKYGGWLEA 
RSLRMQ 


13875 


27776 


A 


14045 


3 


1240 


LVEGAAGQGVSDGARLRKCGTRS FPGSE 
EVLSSMARGSAFPAAALWLWS ILLCLLA 
L RAE AGP P QEE S L YL W I DAHQARVL IGF 
EEDILIVSEGKMAPFTHDFRKAQQRMPA 
ISVNIHSMNFTWQAAGQAEYFYEFLSLR 
SLDKG I MAD PTVNVPLLGTVPHKAS WQ 
VGFPCLGKQDGVAAFEVDVIVMNSEGNT 
ILQTPQNAI FFKT CQQ AEC P GGCRNGG F 
CNERRI CECPDGFHGPHCEKALCTPRCM 
MGGLCVTPGFCICPPGFYGVNCDKANCS 
TTCFNGGTCFYPGKCIGPPGLEGEQCEI 
S KCPQ P CRNGGKC I GKS KCKCS KGYQGD 
LCSKPVCEPGCGAHGTCHEPNKCQCQEG 
WHGRHCNKR YEAS L IHALRPAGAQLRQH 
TPSLKKAEERRDPPESNYIW 


13876 


27777 


A 


14046 


18 


274 


YSQE VLCEMNRSGWSNI KRGWAWWQLT 
TTIPALWEAEVGGLLEPRSLRLASPQPP 
QHEDYSMSYCAQPRFSLCIRGFIFYSLW 
F 


13877 


27778 


A 


14047 


177 


441 


GGSH YAWPN I TF I FTKL I AQ FFC F FEMQ 
SCSVARLECRGAISAHCNLHLPGSSNSP 
VMAQILKNQRKSNFDEDATLLGLSYTLL 
GEI 


13878 


27779 


A 


14048 


136 


319 


D VFLLL FHSN YFL F FE TGS HS VS QAG VK 
WCDLSSLQPPPLGSSDPPTSASRVPALF 
FFLEF 


13879 


27780 


A 


14049 


388 


128 


ELKLPSYKGQSPQLSLRRYFADLIAIVS 
WRFTLCP S ARHLAVYLLDLiFMDRYD I S I 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=GIutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=Iso leu cine, 








26 


correspond 
ingto first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, MHMethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














QQLHLVALSCLLLASKYESDLHDWKFLM 
FI 


13880 


27781 


A 


14050 


262 


3 


VI YNY I S VRPF PNI YHT F YTLS LFL YLS 
THTHTHTHTHTHTQRETGHVCVCIYTYM 
YMCM YAKWW YKFT FL I SGKAR I HTHLHP 
CI 


13881 


27782 


A 


14051 


149 


423 


YPHLCAFLLLPSISSFRAGVFFFSYLSL 
TAYPLTWKPPEPPATPQPTPNPPPGHPA 
TRG P KS AL LDH S YACASAP P LGT EGTAP 
SPSFALI 


13882 


27783 


A 


14052 


2 


1440 


FVAVTAMAAPCLLRQGRAGALKTMLQEA 
QVFRGIiASTVSLSAESGKSEKGQPQNSK 
KQSPPKNWEPKERGKLLATQTAAELSK 
NLSS PS S YPPAVNKGRKVAS PS PSGSVL 
FTDEGVPKFLSRKTLVEFPQKVLSPFRK 
QGSDS EARQVGRKVTS PS S SSSSSSSDS 
E S DD E AD VS E VT P R WS KGRGGLRKP E A 
SHSFENRAFRVTVSAKEKTLLQKPHVD I 
TDPEKPHQ P KKKGS PAKP S EGRENARP K 
TTMPRS QVDEE FLKQSLKEKQLQKTFRL 
NEIDKESQKPFEVKGPLPVHTKSGLSAP 
P KGS PAPAVIiAEEARAEGQLQAS PPGAA 
EGHLEKPVPEPQRKAAPPLPRKETSGTQ 
G I EGHLKGGQAI VEDQI PPSNLETVPVE 
NNHGFHEKTAALKLEAEGEAMEDAAAPG 
NDRGGTQEPAPVPAEPFDNTTYKNLQHH 
DYSTYTFLDLNLELSKFRMPQPSSGRES 
PRH 


13883 


27784 


A 


14053 


177 


1253 


EKKRTLP VRS VTGE LQGKS LS DLAAGTM 
DSEKKRFTEEATKYFRERVSPVHLQILL 
TNNE AW KR F VTAAE L PRDE ADAL YE AL K 
KLRTYAAIEDEYVQQKDEQFREWFLKEF 
PQVKRKIQES IEKLRALANGIEEVHRGC 
TISNWSSSTGAASGIMSLAGLVLAPFT 
AGTS LALTAAGVGLGAAS AVTG I TTS IV 
EHS YTS S AEAEASRLTATS IDRLKVFKE 
VMRDITPNLLSLLNNYYEATQTIGSEIR 
AIRQARARARLPVTTWRISAGSGGQAER 
TIAGTTRAVSRGAR I LSATTSGI FLALD 
WNLVYESKHLHEGAKSASAEELRRQAQ 
ELEENLMELTQIYQRLNPCHTH 


13884 


27785 


A 


14054 


1585 


2099 


ICVKTFPPLALQVRMAAAEHHHSSGLPY 
W P YLTAE TL ECNRMGHQ PPPPTQQHSI TD 
NSLSLKTPSERLLYPLPPSAPPSADDNL 
KTPPECLLTPLPPSALPSADDNLKTPAE 
CLLYPLPPSADDNLKTPPECLFTPLPPS 
AP PS VDDNLKTP PE C VCSL P FHPQRM 1 1 
SRN 


13885 


27786 


A 


14055 


2 


2865 


ALPDGGAS VASDRAEGRPAKP S KTAARE 
KTEGAVAAVGGGP S S FRCCYGCCHEARL 
GRTS LPRGVI MLTE AS LSI WGWGS LG I V 
LFLITFGPFVIFYLTFYILCFVGGGLW 
TLLFGKTNSEKYLEQCEHSFLPPTSPGV 
PKCLEEMKREARTIKIDRRLTGANIIDE 
PLQQVIQFSLRDYVQYWYYTLSDDESFL 
LEIRQTLQNALIQFATRSKEIDWQPYFT 
TR I VDD FGTHLRVFRKAQQ KITE KDDQ V 
KGTAEDLVDTFFEVEVEMEKEVCRDLVC 
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NO: of 
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NO: of 
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M 
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od 


SEQ ID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=PhenylaIanine, 
G=Glycine> H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, A=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














TSPKDEEGFLRDLCEVLLYLLLPPGDFQ 
NKIMRYFVRE ILARGILLPLINQLSDPD 
YINQYVXWMIRDSNCNyEAFMWIIKLSD 
NIGELEAVRDKAAEELQYLRSLDTAGDD 
INTIICNQINSLLFVKKVGDSRIQRLQSG 
KE INTVKLAANFGKLCTVPLDS ILVDNV 
ALQFFMDYMQQTGGQAHLFFWMTVEGYR 
VTAQQQLiEVLLSRQRDGKHQTNQTKGLL 
RAAAVGIYEQYLSEKASPRVTVDDYLVA 
KLADTIiNHEDPTPEIFDDIQRKVYELML 
RDERFYPSFRQNALYVRMLAELDMLKDP 
S FRGS DDGDGE S FNGSPTGS INLSLDDL 
SNVSSDDSVQLHAYISDTVYADYDPYAV 
AGVCNDHGKT YAL YAI T VHRRNLNS E EM 
WKT YRR YSDFHD FHMR I TEQ FE SLS S I L 
KL P E KKT FNNMDRD FLE KR KKDLNAYL Q 
LLLAPEMMKASPALAHYVYDFIiENKAYS 
KGKGD FARKMD T F VNPLRNS MRNVS NAV 
KSLPDSLAEGMTKMSDNMGKMSERLGQD 
IKQSFFKVPPLIPKTDSDPEHRRVSAQL 
DDNVDDNIPLRVMLLiLMDEVFDLKERNQ 
WLRRNIKNLLQQLIRATYGDTINRKIVD 
HVDWMTS PEQVADS VKRFRDAFWPNGIL 
AEAVP CRDKS I RMRTRVAG KTKLLA IMP 
GE 


13886 


27787 


A 


14057 


311 


150 


FLCFKYRRGFATLHSGLKLLGSSDPPSS 
TSQIPGI TGTSHC VQPTHLFFL ALS 


13887 


27788 


A 


14058 


53 


211 


RQH I T CLD I F F I HMNYKYKY 1M I YL S I Y 
LSIYLSIYLIYLSISHSSYITERA 


13888 


27789 


A 


14059 


1 


1642 


RDGRKJYIATATIALQVNGQQGGGSEPAAA 
AAWAAGDKWKP PQGTDS I KMENGQS TA 
AKLGLPPLTPEQQEALQKAKKYAMEQS I 
KS VLVKQT I AHQQQQLTNLQMAAQRQRA 
LAIMCRVYVGSIYYELGEDTIRQAFAPF 
GPIKS IDMSWDS VTMKHKGFAFVEYEVP 
EAAQIiALEQMNSVMLGGRNIKVGRPSNI 
GQAQP I IDQLAEEARAFKRI YVASVHQD 
LSDDD I KS VFEAFGKI KS CTLARDPTTG 
KHKG YG F I E YE KAQ S S QDAVS S MNL FD L 
GGQYLRVGKAVTPPMPLLTPATPGGLPP 
AAAVAAAAATAKI TAQEAVAGAAVLGTL 
GTPGL VS P ALT LAQ P LGTL P QAVMAAQA 
PGVITGVTPARP P I PVTI PSVGWNP I L 
ASPPTLGLLEPKKEKEEEELFPESERPE 
MLSEQEHMSISGSSARHMVMQKLIiRKQE 
STVMVLRNMVDPKDIDDDLEGEVTEECG 
KFGAVNRVI I YQEKQGEEEDAE I I VKI F 
VEFS IAS E THKA I QALNGRWFAGRKWA 
EVYDQERFDNSDLSA 


13889 


27790 


A 


14060 


1238 


1474 


VLALQEGRPWRREPASIDACRLNFQRLR 
RGKFSNVLFPGLAQEALYS GGYHLKFAD 
ELMGGNLKKSTADASGSRGHQIi 


13890 


27791 


A 


14061 


266 


400 


GQWARPVIPALWEAKAGGSLEPSMLRPA 
GQHIKTPSVLITSKKT 


13891 


27792 


A 


14062 


2178 


1881 


VLQAPS ILLDAPRTDGGDMGRAMVARLG 
LWLLLIiALLLPTQIYSSETTTGTSSNSS 
QSTSNTGLAPNPTNATTKAAGGALQSTA 
SLFWSLSLLHLYS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "—Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


13892 


27793 


A 


14063 


261 


1 


NS VLTLALLFS LHVHLPP P P S GVTDTAG 
ALS TGGP P S P T S VTGRGP AH S HAS QL P P 
APGE FAPLNE S CRGWAGE AFLERFPDAW 
VDP 


13893 


27794 


A 


14064 


2384 

* 


1115 


QHFSRRGLCWEQRSSVTSSWTSGAWSP 
P CP P SNAS CNTLHTRDWAS PDPGGQGSL 
GESPGPAPPGQLHTLDTDLHSLAQIGGK 
SPVAGVGNGGSLWPRESPGTANGHSPEH 
TPPGPGPPGPCPTKRRLLPAGEAPDVSS 
EEEGPAPRRRRGSLGHPTAANSSDAKAT 
PFWSHLLPGPKEPVLDPTDCGPMGRRLK 
GARRLKLSPLRSLRKGPGLLSPPRASPV 
PTPAVSRTLLGNFEESLLRGRFAPSGHI 
EGFTAEIGASGSYCPQHVTLPVTVTFFD 
VSEQNAPAPFLGIVDLNPLGRKGYSVPK 
VGT VQ VTLi FNPNQ T WKM FLVT FD FSDM 
PAAHMTFLRHRLFLVPVGEEGNANPTHR 
LLCYLLHLRFRSSRSGRLSLHGDIRLLF 
SRRSLELDTGLPYELQAVTEAPHNPRYS 
PLP 


13894 


27795 


A 


14065 


232 


416 


GWAVQLGIGTEGKEGTSSSERQREGRGD 
GNR KTGRRREGRR S T KTDAP S P YP AHR P 
RSKLI 


13895 


27796 


A 


14067 


238 


45 


IQTCGSHSFCIPSIWEAKAGGLLEPRSL 
SPACPTQKDPISHLGSELPQSFRLLIRL 
VPLFLSAC 


13896 


27797 


A 


14068 


3 


1705 


SCESKATPWRAVSASQELQHPQGGQRSP 
LPGDLTPARPNPAYPLT VE CQRS CSRP V 
PAPLPPHTHPPGSSCFYSSFSFITKATA 
PGAQRRAVTQAERGRMGFLGTGTWILVL 
VLPIQAFPKPGGSQDKSLHNRELSAERP 
LNEQ I AEAEEDKI KKT YPPENKPGQSNY 
SFVDNLNLLKAITEKEKIEKERQSIRSS 
PLDNKLNVEDVDSTKNRKLIDDYDSTKS 
GLDHKFQDDPDGLHQLDGTPLTAEDIVH 
KI AAR I YEENDRAVFDKI VS KLLNLGL I 
TESQAHTLEDEVAEVLQKLISKEAMNYE 
EDPNKPTSWTENQAGKIPEKVTPMAAIQ 
DGLAKGENDETVSNTLTLTNGLERRTKT 
YSEDNFEELQYFPNFYALLKS IDSEKEA 
KEKETL I T I MKTL I DFVKMMVKYGT I S P 
EEGVSYLENLDEMIALQTKNKLEKNATD 
NISKLFPAPSEKSHEETDSTKEEAAKME 
KE YGS L KD S T KDDNS NPGGKTD E P KGKT 
EAYLEAI RKNI E WLKKHDKKGNKEDYDL 
S KMRDF INKQADAYVE KGI LDKEEAEAI 
KRIYSSL 


13897 


27798 


A 


14069 


1 


140 


NTSVAIQTIEYYSAFKRKETLTHVPLWM 
NLKDTMLRE I SQSQKDTV 


13898 


27799 


A 


14071 


212 


104 


HTNHQ CQTVLPAGQALAT PQGLAPS PVF 
PQSCLRMV 


13899 


27800 


A 


14072 


358 


112 


FPLTVMLNVFLWWQNLKHFCWFIQEQLM 
FFFFFFLRTGSHYWQAGLELLGSTNLP 
ASVSRVAGTTGIHNCTQFNPLHTLHL 


13900 


27801 


A 


14073 


3 


151 


YMGFHHVGYAGLELLTSSDLPPWPPKCW 
DYRHE PS CLAMFF YFAL I APE 


13901 


27802 


A 


14074 


8 


1493 


VT I HHL F VGQ AVRADTLKKKS P S CLL C I 
VFIPVPRIDLTNIDQVAVIFKHHFPVGR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GDAVLKTWAPAQCLCSRMGPAWLWLLGT 
GILASVHCQPLIAHGDKSLQGPQPPRHQ 
LS EPAPAYHRITP T I TNFALRLYKELAA 
DAPGNI FFS P VS I STTLALLSLGAQANT 
SALILEGLGFNLTETPEADIHQGFRSLL 
HTLALPSPKLELKVGNSLFLDKRLKPRQ 
HYLDS I KE LYGAFAF S ANFTD S VTTGRQ 
INDYLRRQTYGQWDCLPEFSQDTFMVL 
ANYIFFKAKWKHPFSRYQTQKQESFFVD 
E RTS L Q VPMMHQ KEMHRFLYD QDLiACT V 
LQ I E YRGNALALL VL PDPGKMKQ VE AAL 
QPQTLRKWGQLLLPSLLDLHLPRFSISG 
TYNLEDILPQIGLTNILNLEADFSGVTG 
QLNKT I S KVSHKAMVDMSE KGTEAGAAS 
GLLSQPPSLNTMSDPHAHFNRPFLLLLW 
EVTTQSLLFLGKWNPVAG 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1-13901 , a mature protein coding portion of SEQ ID NO: 1-13901 , 
an active domain of SEQ ID NO: 1-13901 , and complementary sequences thereof. 

5 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 
conditions. 

10 3. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1 . 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

15 5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 
20 7. An expression vector comprising the polynucleotide of claim 1. 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 operatively 
25 associated with a regulatory sequence that modulates expression of the polynucleotide in the host 

cell. 

10. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 
30 (b) a polypeptide encoded by a polynucleotide hybridizing under stringent 

conditions with any one of SEQ ID NOS: 1 -1 390 1 . 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 
35 12. An antibody directed against the polypeptide of claim 10. 
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13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polynucleotide of claim 1 for a period sufficient to form the complex; and 
5 b) detecting the complex, so that if a complex is detected, the polynucleotide 

of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
10 nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the polynucleotide of 

claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

15 sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the method 
further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide. 

20 16. A method for detecting the polypeptide of claim 10 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polypeptide under conditions and for a period sufficient to form the complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 10 is detected. 

25 

17. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 1 0 under 
conditions sufficient to form a polypeptide/compound complex; and 
30 b) detecting the complex, so that if the polypeptide/compound complex is 

detected, a compound that binds to the polypeptide of claim 10 is identified. 

1 8. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 
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a) contacting the compound with the polypeptide of claim 10 3 in a cell, tinder 
conditions sufficient to form a polypeptide/compound complex, wherein the complex drives 
expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, so 
5 that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide 

of claim 10 is identified 

19. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected from 
1 0 the group consisting of a polynucleotide sequence of SEQ ID NO: 1-13901 , a mature protein 
coding portion of SEQ ID NO: 1-13901 , an active domain of SEQ ID NO: 1-13901 , 
complementary sequences thereof and a polynucleotide sequence hybridizing under stringent 
conditions to SEQ ID NO: 1-13901 , under conditions sufficient to express the polypeptide in 
said cell; and 

15 b) isolating the polypeptide from the cell culture or cells of step (a). 

20. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 13902-27802, the mature protein portion 
thereof, or the active domain thereof. 

20 

21 . The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 

22. A collection of polynucleotides, wherein the collection comprises the 
25 sequence information of at least one of SEQ ID NOS: 1-1390L 

23. The collection of claim 22, wherein the collection is provided on a nucleic 
acid array. 

30 24. The collection of claim 23, wherein the array detects full-matches to any 

one of the polynucleotides in the collection. 

25. The collection of claim 23, wherein the array detects mismatches to any 

one of the polynucleotides in the collection. 

35 
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26. The collection of claim 22, wherein the collection is provided in a 
computer-readable format. 

27. A method of treatment comprising administering to a mammalian subject in need 

thereof a therapeutic amount of a composition comprising a polypeptide of claim 
1 0 or 20 and a pharmaceutically acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need 

thereof a therapeutic amount of a composition comprising an antibody that 
specifically binds to a polypeptide of claim 10 or 20 and a pharmaceutically 
acceptable carrier. 
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